OFDM: From Foundations to 6G
A Complete Study Notebook — 4G LTE · 5G NR · 6G Horizon
This notebook provides a rigorous end-to-end treatment of OFDM — from DFT/IDFT mathematical foundations through CP-OFDM, DFT-s-OFDM, multipath resilience, synchronization, channel estimation, LTE/5G NR air interface design, and the 6G outlook. Covers FFT/IFFT signal processing fundamentals, PAPR, frequency/timing offsets, flexible 5G numerology, and OFDM variants (FBMC, OTFS, f-OFDM), alongside a comprehensive pros/cons analysis. Each section builds systematically on the last, balancing mathematical rigour with practical implementation insight drawn from 3GPP specifications. Embedded interactive charts and worked numerical examples reinforce every major concept for active, hands-on learning.
| Section | Section |
|---|---|
| §1 Mathematical Foundations (DFT/IDFT/FFT/IFFT) | §7 LTE (4G) OFDM & SC-FDMA |
| §2 CP-OFDM Signal Model & Spectrum | §8 5G NR Flexible Numerology |
| §3 Cyclic Prefix & Multipath Robustness | §9 OFDM Variants (FBMC/OTFS/f-OFDM) |
| §4 PAPR Problem & Reduction Techniques | §10 Pros & Cons Analysis |
| §5 Frequency & Timing Synchronization | §11 6G Outlook (THz / AI / ISAC) |
| §6 Channel Estimation & Equalization | — |
Core: Linear algebra (FFT, matrix operations), probability & statistics, basic DSP (sampling, convolution, z-transforms).
Recommended: 5G NR frame structure before §7 onward. Familiarity with complex baseband notation helpful throughout.
- Sequential or jump reading: each section is self-contained; cross-references are provided for dependencies.
- Offline mathematics: all equations render without an internet connection.
- Interactive charts: hover, zoom, and toggle traces to explore signal and spectral plots.
- Study questions: appear at each section end — attempt them before checking the worked solutions.
Mathematical Foundations — DFT, IDFT, FFT, IFFT
1.1 From Fourier Series to the Discrete Fourier Transform
The journey toward the DFT begins with the observation that any sufficiently well-behaved periodic function can be decomposed into a weighted sum of harmonically related sinusoids — a fact first systematized by Joseph Fourier in 1822. Understanding the conceptual chain Fourier Series → Fourier Transform → DTFT → DFT shows that the DFT is not an ad-hoc engineering trick but the natural endpoint of discretizing and periodizing the continuous transform.
1.1.1 Fourier Series (continuous-time, periodic)
Let $x(t)$ be $T_0$-periodic. The Fourier Series (FS) representation is:
where the coefficients $c_k$ are obtained by the analysis integral:
The set $\{e^{j 2\pi k f_0 t}\}_{k \in \mathbb{Z}}$ forms a complete orthonormal basis for $L^2([0,T_0])$:
The frequency spectrum is discrete (harmonics at $kf_0$) but the time signal is continuous.
1.1.2 Continuous-Time Fourier Transform (CTFT)
When $T_0 \to \infty$ (the signal is no longer constrained to be periodic, or equivalently the fundamental frequency $f_0 \to df$ becomes infinitesimal), the harmonic sum becomes an integral — the Fourier Transform:
Both time and frequency are now continuous. Infinite support in time implies a continuous frequency spectrum. The spectrum is generally complex-valued; $|X(f)|$ is the amplitude spectrum and $\angle X(f)$ is the phase spectrum.
1.1.3 Discrete-Time Fourier Transform (DTFT)
Practical digital systems work with sequences $x[n]$ obtained by sampling $x(t)$ at interval $T_s$ (rate $f_s = 1/T_s$). The DTFT maps a discrete-time sequence to a continuous, periodic frequency function:
The DTFT is $2\pi$-periodic in $\omega$ (equivalently, $f_s$-periodic in Hz). It is still a continuous function of $\omega$ and requires an infinite sequence — not computable directly on a machine.
1.1.4 Arriving at the DFT
If we additionally limit the sequence to $N$ samples and evaluate the DTFT at $N$ equally-spaced frequencies $\omega_k = 2\pi k / N$ for $k = 0, 1, \ldots, N-1$, we obtain the Discrete Fourier Transform:
Both the time-domain sequence $x[n]$ and the frequency-domain sequence $X[k]$ are now finite and discrete — perfectly suited for digital computation. The DFT implicitly assumes both sequences are periodic with period $N$.
| Transform | Time domain | Frequency domain | Computable? |
|---|---|---|---|
| Fourier Series | Continuous, periodic | Discrete, aperiodic | Analytically |
| CTFT | Continuous, aperiodic | Continuous, aperiodic | Analytically |
| DTFT | Discrete, aperiodic | Continuous, periodic | No (infinite sum) |
| DFT | Discrete, periodic (N) | Discrete, periodic (N) | Yes — O(N²) or O(N log N) |
1.2 DFT and IDFT — Formal Definitions
Let $x[n]$, $n = 0, 1, \ldots, N-1$, be a finite sequence of (possibly complex) numbers. Define the N-th root of unity twiddle factor:
so that $W_N^{kn} = e^{-j 2\pi kn/N}$. The DFT pair is then written compactly as:
Physical interpretation of DFT bins:
- Bin $k=0$: DC component — $X[0] = \sum_n x[n]$ (sum of all samples).
- Bin $k$: complex amplitude of the sinusoid at normalized frequency $f_k = k/N$ cycles per sample, or in Hz: $f_k = k f_s / N$ where $f_s$ is the sample rate.
- Bins $k = N/2+1, \ldots, N-1$ correspond to negative frequencies $f = (k-N)/N$ (aliased by periodicity); for real $x[n]$, these are complex conjugates of bins $1$ through $N/2-1$.
- Bin $k = N/2$: the Nyquist frequency $f_s/2$.
numpy.fft.fft/numpy.fft.ifft).
Some mathematics texts place $1/\sqrt{N}$ on both transforms for a unitary operator.
Always verify which convention applies before comparing results across sources.
1.3 Orthogonality of Complex Exponentials — Proof
The correctness of the IDFT formula — that it inverts the DFT exactly — rests entirely on the discrete orthogonality of the complex exponential basis functions. We now prove this fundamental identity.
Proof. Let $r = k - m$. Consider two cases:
Case 1: $r \equiv 0 \pmod{N}$ (i.e., $k=m$ mod $N$).
Then $e^{j 2\pi r n/N} = e^{0} = 1$ for all $n$. The sum equals $\sum_{n=0}^{N-1} 1 = N$. ■
Case 2: $r \not\equiv 0 \pmod{N}$.
Let $\alpha = e^{j 2\pi r/N}$. Since $r$ is not a multiple of $N$, $\alpha \neq 1$. The sum is a finite geometric series:
Now $\alpha^N = e^{j 2\pi r N/N} = e^{j 2\pi r} = 1$ (since $r$ is an integer and Euler's formula gives $e^{j 2\pi r} = \cos(2\pi r) + j\sin(2\pi r) = 1$). Therefore the numerator $1 - \alpha^N = 1 - 1 = 0$, while the denominator $1 - \alpha \neq 0$. Hence $S = 0$. ■
Consequence — IDFT inverts DFT. Start from the DFT:
Substitute into the IDFT candidate formula and exchange the order of summation:
The IDFT exactly recovers $x[n]$ for each $n = m$. ■
1.4 DFT as Matrix Multiplication
Writing the DFT in full for all $k$ simultaneously reveals that it is simply a linear transformation representable by a single DFT matrix $\mathbf{W} \in \mathbb{C}^{N \times N}$.
where the vectors are $\mathbf{x} = [x[0], x[1], \ldots, x[N-1]]^T$ and $\mathbf{X} = [X[0], X[1], \ldots, X[N-1]]^T$, and the $(k, n)$ entry of $\mathbf{W}$ (using 0-based indexing) is:
For $N = 4$, the DFT matrix expands explicitly as (using $j = \sqrt{-1}$):
Note $W_4^0 = 1$, $W_4^1 = e^{-j\pi/2} = -j$, $W_4^2 = e^{-j\pi} = -1$, $W_4^3 = e^{-j3\pi/2} = j$. Each row $k$ of $\mathbf{W}_N$ is a complex sinusoid at frequency $k/N$ cycles/sample. The DFT is the inner product of the input vector with each row — i.e., the projection of $x[n]$ onto each basis sinusoid.
Inverse DFT matrix:
where $\mathbf{W}^H$ denotes the conjugate transpose. This follows directly from the orthogonality theorem: $\mathbf{W}^H \mathbf{W} = N \mathbf{I}$, so $\mathbf{W}^{-1} = \frac{1}{N}\mathbf{W}^H$.
Computational cost of direct matrix multiplication: Each of the $N$ output bins requires $N$ complex multiplications and $N-1$ complex additions. Total: $O(N^2)$ complex multiplications. For $N = 1024$: roughly $10^6$ multiplications. For $N = 4096$: $\sim 1.7 \times 10^7$. This is why direct computation becomes prohibitive for large $N$, motivating the FFT.
1.5 The FFT Algorithm — Cooley-Tukey Radix-2 DIT
The Fast Fourier Transform (FFT) is not a different mathematical transform — it computes the exact same DFT, but exploits the special structure of the DFT matrix $\mathbf{W}$ to reduce the arithmetic to $O(N \log_2 N)$ operations. The most common variant is the Cooley-Tukey radix-2 decimation-in-time (DIT) algorithm, published in 1965 (though the core idea was known to Gauss in 1805).
1.5.1 Divide and Conquer — The Danielson-Lanczos Lemma
Assume $N = 2^m$ (a power of 2). Split the input sequence into two $N/2$-point sub-sequences — the even-indexed samples and the odd-indexed samples:
The DFT can be rewritten as:
Using $W_N^{2rk} = e^{-j 2\pi (2r)k/N} = e^{-j 2\pi rk/(N/2)} = W_{N/2}^{rk}$:
where $E[k]$ and $O[k]$ are the $N/2$-point DFTs of the even and odd sub-sequences, respectively. The factor $W_N^k$ is called the twiddle factor.
1.5.2 The Butterfly and Periodicity
Since $E[k]$ and $O[k]$ are $N/2$-periodic, we have:
and $W_N^{k+N/2} = W_N^k \cdot W_N^{N/2} = -W_N^k$. Therefore:
These two equations computed together for each $k = 0, \ldots, N/2 - 1$ form a butterfly operation: two inputs $(E[k], O[k])$, one complex multiplication by $W_N^k$, one addition, one subtraction — yielding two outputs.
1.5.3 Bit-Reversal Permutation
The DIT FFT requires the input to be in bit-reversed order. For $N=8$, the natural indices $0{-}7$ in binary are $000, 001, 010, 011, 100, 101, 110, 111$. Reversing the 3-bit binary representations gives: $000, 100, 010, 110, 001, 101, 011, 111$ → indices $0, 4, 2, 6, 1, 5, 3, 7$. The butterfly structure then processes interleaved sub-sequences at each stage.
1.5.4 Complexity Analysis
| $N$ (FFT size) | Direct DFT ($N^2$ mults) | FFT ($\frac{N}{2}\log_2 N$ mults) | Speedup factor |
|---|---|---|---|
| 64 | 4,096 | 192 | 21× |
| 256 | 65,536 | 1,024 | 64× |
| 1,024 | 1,048,576 | 5,120 | 205× |
| 4,096 | 16,777,216 | 24,576 | 683× |
| 65,536 | $4.3 \times 10^9$ | 524,288 | 8,192× |
The speedup grows without bound as $N$ increases. For 5G NR OFDM with $N = 4096$ (FR1 maximum), the FFT is $\sim 683\times$ faster than direct computation — making real-time OFDM processing practical on embedded hardware.
1.5.5 Why $N = 2^m$ is Preferred
The radix-2 Cooley-Tukey algorithm applies only when $N$ is a power of 2, enabling $\log_2 N$ recursion levels each with $N/2$ butterflies. For other $N$:
- Radix-4, radix-8 FFT: $N = 4^m$ or $8^m$ — even fewer multiplications due to larger butterflies (radix-4 saves $\sim 25\%$ mults vs radix-2).
- Mixed-radix FFT: $N = p \cdot q$ (composite) — split into $p$-point and $q$-point sub-DFTs. FFTW uses this for arbitrary $N$.
- Prime-length FFT: Rader's algorithm or Bluestein's chirp-z transform for prime $N$. Much less efficient.
- 5G NR practice: FFT sizes are always powers of 2 (128 to 4096), chosen to match the subcarrier-count requirement while enabling efficient radix-2/4/8 FFT implementations in hardware (FPGAs, ASICs).
1.6 IFFT — The Inverse FFT and Its Role in OFDM
The IDFT is:
Observe that this is the same computational structure as the DFT, but with the twiddle factors conjugated ($e^{+j\ldots}$ vs $e^{-j\ldots}$) and with a $1/N$ scaling. The IFFT is therefore computed by:
- Conjugate the input: $X^*[k]$
- Apply the standard FFT to get $Y[n] = \text{FFT}(X^*[k])$
- Conjugate the output and scale: $x[n] = Y^*[n] / N$
Alternatively, IFFT = complex-conjugate the twiddle factors in the FFT butterfly and divide by $N$ at the end. Most hardware and software implement the FFT and IFFT with a shared butterfly core, differing only in the sign of the twiddle factor angle and the scaling.
1.7 Key Properties of the DFT / FFT
All properties below are stated for the $N$-point DFT and use the notation $x[n] \xleftrightarrow{\text{DFT}} X[k]$. Index arithmetic is modulo $N$.
Property 1 — Linearity
The DFT is a linear operator. This follows immediately from the matrix form $\mathbf{W}(a\mathbf{x}_1 + b\mathbf{x}_2) = a\mathbf{W}\mathbf{x}_1 + b\mathbf{W}\mathbf{x}_2$. Consequence: OFDM subcarriers are modulated and demodulated independently using superposition.
Property 2 — Time-Shift $\Rightarrow$ Phase Rotation in Frequency
A circular shift of $n_0$ samples in time multiplies every DFT bin by a linear phase ramp $e^{-j 2\pi k n_0/N}$. The magnitude spectrum $|X[k]|$ is unchanged — only the phase rotates. In OFDM, the cyclic prefix exploits this property: a channel delay $\tau$ introduces a phase shift $e^{-j 2\pi k \tau/N}$ per subcarrier, which is corrected by a per-subcarrier complex division (one-tap equalizer) after the FFT at the receiver.
Property 3 — Frequency-Shift $\Rightarrow$ Phase Rotation in Time (Modulation)
Multiplying by a complex exponential in time circularly shifts the spectrum. This is the discrete-time analog of the AM modulation theorem and is used in frequency-domain equalization and carrier offset estimation.
Property 4 — Circular Convolution Theorem (most important for OFDM)
where $\circledast$ denotes circular (cyclic) convolution of length $N$:
Key consequence for OFDM: A multipath channel of length $L$ (impulse response $h[n]$) performs linear convolution on the transmitted signal. By appending a cyclic prefix of length $\geq L-1$ to each OFDM symbol, the linear convolution becomes effectively circular over the FFT window. The FFT at the receiver then converts this circular convolution to a pointwise multiplication: $Y[k] = H[k] \cdot X[k]$. Channel equalization reduces to $N$ independent scalar divisions — one per subcarrier — instead of matrix inversion. This is the core computational advantage of OFDM in frequency-selective channels.
Property 5 — Parseval's Theorem (Energy Conservation)
The total energy of the time-domain signal equals the total energy of the frequency-domain signal divided by $N$ (due to the $1/N$ convention). In the unitary DFT convention, the $1/N$ factor is absent and energy is exactly preserved: $\|\tilde{\mathbf{W}}\mathbf{x}\|^2 = \|\mathbf{x}\|^2$. Parseval's theorem is used in OFDM PAPR (Peak-to-Average Power Ratio) analysis and signal-to-noise ratio calculations.
Property 6 — Conjugate Symmetry (Real Inputs)
If $x[n]$ is real-valued, then:
The DFT is Hermitian symmetric: bins $k$ and $N-k$ are complex conjugates. Only the first $N/2 + 1$ bins are independent. This is exploited in the real FFT (rfft) which computes only bins $0$ through $N/2$ in roughly half the operations of a complex FFT. In 5G NR downlink, the OFDM waveform is real-valued after RF upconversion (mixing with carrier), though the baseband model uses complex I+Q representation where this symmetry does not apply.
Property 7 — Circular Shift in Frequency → Multiplication in Time
| Property | Time domain | Frequency domain |
|---|---|---|
| Linearity | $ax_1[n]+bx_2[n]$ | $aX_1[k]+bX_2[k]$ |
| Time shift | $x[\langle n-n_0\rangle_N]$ | $e^{-j2\pi kn_0/N}X[k]$ |
| Freq shift | $e^{j2\pi k_0 n/N}x[n]$ | $X[\langle k-k_0\rangle_N]$ |
| Circ. conv. | $x_1\circledast x_2$ | $X_1[k]\cdot X_2[k]$ |
| Parseval | $\sum|x[n]|^2$ | $\frac{1}{N}\sum|X[k]|^2$ |
| Conjugate sym. | $x[n]$ real | $X[N-k]=X^*[k]$ |
| Time reversal | $x[\langle{-n}\rangle_N]$ | $X[\langle{-k}\rangle_N]$ |
1.8 OFDM Connection — How IFFT Creates and FFT Demodulates Subcarriers
OFDM (Orthogonal Frequency-Division Multiplexing) is a direct engineering application of the IDFT/DFT pair. The insight is elegant: the IDFT formula for OFDM modulation and the DFT formula for demodulation are computationally identical to the IFFT/FFT algorithms already described.
1.8.1 OFDM Transmitter (IFFT-based)
Given $N$ complex data symbols $\{X[k]\}_{k=0}^{N-1}$ (one per subcarrier, e.g., QAM points), the continuous-time OFDM signal over one symbol period $T = N \cdot T_s$ would ideally be:
where $\Delta f = 1/T$ is the subcarrier spacing (in Hz). At the $n$-th sample time $t = nT_s = n/(N\Delta f)$:
This is exactly the IDFT (up to the $1/N$ normalization). The IFFT simultaneously generates all $N$ subcarrier sinusoids and sums them — creating the composite time-domain waveform in a single $O(N \log N)$ operation.
1.8.2 Cyclic Prefix Insertion
After the IFFT, the last $N_{CP}$ samples of $s[n]$ are prepended to the symbol:
followed by the original $N$ samples. Total transmitted length: $N + N_{CP}$. This converts the linear channel convolution into circular convolution (as explained in §1.7, Property 4), enabling simple one-tap frequency-domain equalization.
1.8.3 OFDM Receiver (FFT-based)
After removing the cyclic prefix, the received signal (assuming AWGN channel with frequency-selective fading $H[k]$) in the frequency domain is:
where $H[k] = \text{FFT}\{h[n]\}$ is the channel frequency response at subcarrier $k$ and $Z[k]$ is the additive noise. The FFT at the receiver demodulates all $N$ subcarriers simultaneously in $O(N \log N)$ operations.
Equalization then recovers the data symbol on each subcarrier independently:
- Without OFDM: a wideband channel with $L$ taps requires a complex equalizer with $\sim L$ taps (up to hundreds of coefficients in mobile channels) operating at the full sample rate.
- With OFDM + FFT: the FFT converts the wideband problem into $N$ independent narrowband problems. Each subcarrier sees a flat fading channel (single complex coefficient). Equalization costs $N$ complex divisions — independent of $L$.
- The FFT converts $O(N \cdot L)$ equalization complexity to $O(N \log N + N)$.
- For 5G NR with $N = 4096$ subcarriers and channel delay spread $L \approx 100$ taps, this is a $\sim 28\times$ reduction in computational complexity, on top of the modulation efficiency from packing $N$ symbols per FFT window.
1.9 Numerical Example — $N = 8$, Rectangular Pulse
Let $x[n] = 1$ for $n = 0, 1, 2, 3$ and $x[n] = 0$ for $n = 4, 5, 6, 7$ (a length-4 rectangular window in an $N = 8$ DFT). Compute $X[k]$ analytically.
This is a geometric series with ratio $\alpha = e^{-j\pi k/4}$:
using the identity $\sum_{n=0}^{M-1} \alpha^n = e^{j(M-1)\phi/2} \cdot \sin(M\phi/2)/\sin(\phi/2)$ where $\phi = -\pi k/4$.
| $k$ | $X[k]$ (exact) | $|X[k]|$ | $\angle X[k]$ (rad) |
|---|---|---|---|
| 0 | $4 + 0j$ | 4.000 | 0 |
| 1 | $1 - 2.414j$ | 2.613 | $-1.176$ |
| 2 | $0 - 1j$ | 1.000 | $-\pi/2$ |
| 3 | $1 - 0.414j$ | 1.082 | $-0.392$ |
| 4 | $0 + 0j$ | 0.000 | — |
| 5 | $1 + 0.414j$ | 1.082 | $+0.392$ |
| 6 | $0 + 1j$ | 1.000 | $+\pi/2$ |
| 7 | $1 + 2.414j$ | 2.613 | $+1.176$ |
Observations: (1) $X[0] = 4$ is the sum of all samples (DC component). (2) $X[4] = 0$ exactly — the Nyquist bin is zero because the rectangular pulse has a null at $f = f_s/2$. (3) The spectrum is Hermitian: $X[k] = X^*[N-k]$ since $x[n]$ is real. (4) The $|X[k]|$ envelope follows a Dirichlet kernel (discrete sinc) shape.
1.10 Interactive DFT Magnitude Spectrum
The chart below shows the DFT of $x[n] = \cos\!\left(\frac{2\pi \cdot 3 \cdot n}{N}\right) + 0.5\cos\!\left(\frac{2\pi \cdot 1 \cdot n}{N}\right)$ for $N = 16$ samples. Analytically, we expect spectral peaks at bins $k = 1, 3$ (positive frequencies) and $k = 13, 15$ (corresponding negative frequencies), with magnitudes $N/2 = 8$ for the $k=3$ tone and $N/4 = 4$ for the $k=1$ tone.
1.11 Spectral Leakage, the Dirichlet Kernel, and Windowing
The DFT implicitly assumes the $N$-sample block is one period of a periodic signal. When the signal is not an integer number of cycles within the DFT window (i.e., the frequency does not fall exactly on a bin), energy "leaks" into adjacent bins — a phenomenon called spectral leakage.
The mathematical origin: the DFT of a finite-length signal is equivalent to the DTFT of the signal multiplied by a rectangular window $w[n] = 1$ for $0 \leq n \leq N-1$. In frequency, this multiplication becomes circular convolution of the ideal spectrum with the DFT of the rectangle — the Dirichlet kernel:
$|D_N|$ has a main lobe of width $4\pi/N$ and sidelobes at approximately $-13\,\text{dB}$ below the main lobe peak. Spectral leakage is the convolution of any sharp spectral feature with these sidelobes.
Windowing applies a time-domain taper $w[n]$ to the signal before the DFT, trading main-lobe width (frequency resolution) for reduced sidelobe level (reduced leakage):
| Window | Main lobe width | Peak sidelobe (dB) | Use case |
|---|---|---|---|
| Rectangular | $4\pi/N$ | $-13$ | Best resolution, worst leakage |
| Hann (Hanning) | $8\pi/N$ | $-31.5$ | General-purpose audio/spectral |
| Hamming | $8\pi/N$ | $-42.7$ | Filter design |
| Blackman | $12\pi/N$ | $-58.1$ | Low leakage needed |
| Kaiser ($\beta=8$) | Variable | $-69$ | Flexible tradeoff |
1.12 Zero-Padding and DFT Interpolation
Appending $M$ zeros to $x[n]$ before computing an $(N+M)$-point DFT does not add new information but produces a denser sampling of the DTFT — effectively interpolating the frequency spectrum:
The frequency resolution (bin spacing) becomes $f_s/(N+M)$ instead of $f_s/N$. Zero-padding is used to: (1) make $N$ a power of 2 for the FFT, (2) improve visual interpolation of spectra, (3) compute circular cross-correlation via the overlap-add method for linear convolution.
- DFT inversion proof. Starting only from the DFT definition $X[k] = \sum_{n=0}^{N-1} x[n]\, e^{-j2\pi kn/N}$ and the discrete orthogonality property, prove that the IDFT formula $x[m] = \frac{1}{N}\sum_{k=0}^{N-1} X[k]\, e^{j2\pi km/N}$ recovers every sample $x[m]$ exactly. Identify the step where the geometric series formula is applied, and explain why $r \not\equiv 0 \pmod{N}$ is required for the sum to vanish.
- FFT complexity and OFDM feasibility. A 5G NR base station processes $K = 100$ OFDM symbols per 1 ms subframe at $N_{FFT} = 4096$. (a) Compute the total number of complex multiplications required per subframe using (i) direct DFT and (ii) radix-2 FFT ($\frac{N}{2}\log_2 N$ multiplications per transform). (b) If each complex multiplication takes 10 ns on a DSP, estimate the processing time for each approach. (c) Explain why the cyclic prefix — even though it wastes bandwidth — is nevertheless essential for the one-tap equalizer derivation based on the circular convolution theorem.
- Subcarrier orthogonality and ICI. In OFDM, a carrier frequency offset (CFO) of $\epsilon$ (in units of subcarrier spacing $\Delta f$) causes the subcarrier spacing $\Delta f$ to shift to $(1+\epsilon)\Delta f$ for subcarrier $k$. Using the discrete orthogonality sum, show that for $\epsilon \neq 0$ the inner product between subcarrier $k$ and subcarrier $m \neq k$ over the DFT window is no longer zero — i.e., inter-carrier interference (ICI) is introduced. Sketch how the ICI power depends on $\epsilon$ for small offsets and state the consequence for OFDM synchronization requirements.
CP-OFDM Signal Model & Spectrum
2.1 CP-OFDM Baseband Signal Model
The continuous-time CP-OFDM baseband signal for a single OFDM symbol is the superposition of N complex exponentials, one per subcarrier:
| Symbol | Meaning | Typical value (5G NR, μ=1) |
|---|---|---|
| $$N$$ | Number of active subcarriers | up to 3276 (FR1, 100 MHz) |
| $$S_k \in \mathbb{C}$$ | QAM symbol on subcarrier k | QPSK / 16-QAM / 256-QAM |
| $$\Delta f$$ | Subcarrier spacing (SCS) | 30 kHz |
| $$T_u = 1/\Delta f$$ | Useful OFDM symbol duration | 33.33 µs |
2.2 Multicarrier Structure — N Parallel Narrow-Band Channels
OFDM is a multicarrier modulation scheme. Rather than transmitting one symbol per use of the channel at the full bandwidth B (as in single-carrier BPSK/QAM), OFDM divides B into N narrow sub-bands, each of width ≈ Δf, and transmits one QAM symbol per sub-band simultaneously:
Frequency →
┌────────┬────────┬────────┬────────┬────────┬────────┬────────┬────────┐
│ S₀ │ S₁ │ S₂ │ S₃ │ S₄ │ S₅ │ S₆ │ S₇ │ ← N=8 subcarriers
│ f=0 │ f=Δf │ f=2Δf │ f=3Δf │ f=4Δf │ f=5Δf │ f=6Δf │ f=7Δf │
└────────┴────────┴────────┴────────┴────────┴────────┴────────┴────────┘
←————————————————— bandwidth B = N·Δf ——————————————————→
Each sub-band sees an effectively flat channel (if Δf ≪ coherence BW)
→ single complex multiply suffices for equalization per subcarrier
The serial-to-parallel (S/P) converter at the transmitter maps a stream of QAM symbols onto the N frequency-domain inputs of the IFFT. After the IFFT and DAC, all N tones are present in the transmitted signal simultaneously — the channel sees a wideband signal, but the DSP treats it as N independent narrowband channels.
2.3 Orthogonality Proof
The fundamental reason CP-OFDM works without inter-subcarrier interference (ICI) is that the set of complex exponentials \(\{\phi_k(t) = e^{j2\pi k \Delta f t}\}\) forms an orthogonal basis over the interval \([0, T_u]\).
Case 1: m = n
Case 2: m ≠ n (let \(p = m - n \in \mathbb{Z} \setminus \{0\}\))
Substituting \(\Delta f \cdot T_u = 1\):
Combining both cases with the Kronecker delta \(\delta_{mn}\):
2.4 Subcarrier Spacing & Symbol Duration — 5G NR Numerologies
The fundamental constraint is the time-frequency uncertainty relation for OFDM:
5G NR defines a set of numerologies indexed by \(\mu \in \{0,1,2,3,4\}\), where \(\Delta f = 2^\mu \times 15\,\text{kHz}\). Higher \(\mu\) gives larger spacing (shorter symbols), trading time resolution for Doppler robustness.
| Numerology μ | \(\Delta f\) (kHz) | \(T_u\) (µs) | CP (normal, µs) | \(T_s = T_u + T_{CP}\) (µs) | Use case |
|---|---|---|---|---|---|
| 0 | 15 | 66.67 | 4.69 | 71.35 | FR1 sub-6 GHz, eMBB |
| 1 | 30 | 33.33 | 2.34 | 35.68 | FR1 sub-6 GHz, eMBB/URLLC |
| 2 | 60 | 16.67 | 1.17 | 17.84 | FR1 (no SRS) / FR2 mmWave |
| 3 | 120 | 8.33 | 0.59 | 8.92 | FR2 mmWave |
| 4 | 240 | 4.17 | 0.29 | 4.46 | FR2 reference signal only |
2.5 OFDM Symbol with Cyclic Prefix (CP)
After the IFFT produces N time-domain samples \(\{s_0, s_1, \ldots, s_{N-1}\}\), a cyclic prefix of length \(N_{CP}\) samples is prepended:
↑ last N_CP samples
CP symbol: [ s_{N-N_CP} … s_{N-1} | s₀ s₁ s₂ … s_{N-1} ]
←——— CP ———→ ←——————— useful part ——————→
N_CP samples N samples
←————————— total: N + N_CP samples ——————————→
The total symbol duration is:
where \(f_s = N \cdot \Delta f\) is the sampling rate.
Let the channel impulse response be \(h[n]\), length \(L_{h} \le N_{CP} + 1\). The received signal (ignoring noise) is the linear convolution:
\(r[n] = (h * x)[n] = \sum_{l=0}^{L_h-1} h[l]\,x[n-l]\)
At the receiver, after removing the CP (discarding the first \(N_{CP}\) samples), the remaining \(N\) samples satisfy:
\(r[n] = \sum_{l=0}^{L_h-1} h[l]\,x[(n-l) \bmod N]\) for \(n = 0, \ldots, N-1\)
This is the circular convolution \(r = h \circledast x\) of length \(N\). Taking the DFT of both sides:
\(R[k] = H[k] \cdot S[k]\)
The channel acts as a scalar multiplier per subcarrier — one complex divide for equalization. This is the entire reason OFDM is efficient in multipath channels.
2.6 Discrete-Time CP-OFDM — the IFFT Implementation
In practice, the continuous-time model is implemented entirely in discrete time. The N-point IFFT of the frequency-domain symbols \(\{S_0, \ldots, S_{N-1}\}\) yields the time-domain samples:
This is the N-point IDFT. The factor \(1/N\) is an implementation convention (some implementations absorb it into the FFT scaling). Comparing to the continuous-time model with \(t = n T_s / N = n / f_s\):
The discrete and continuous models are identical at the sample points.
↓
[S/P Converter] → N parallel symbols
↓
[N-point IFFT] → N time samples {s_n}
↓
[Add CP: prepend last N_CP samples]
↓
[P/S Converter] → serial stream (N + N_CP samples)
↓
[DAC + RF Upconverter] → transmitted signal x(t)
Receiver (reverse path):
RF → ADC → Remove CP (discard first N_CP samples) → N-point FFT → divide by H[k] → QAM demapper → bits
The complexity advantage over a bank of N matched filters is enormous: \(\mathcal{O}(N \log_2 N)\) for the FFT versus \(\mathcal{O}(N^2)\) for direct DFT computation. For \(N = 4096\) (5G NR, 100 MHz FR1), this is a factor of \(\sim\!341\times\) reduction in multiply-accumulate operations.
2.7 CP-OFDM Spectrum & Spectral Efficiency
Each subcarrier k, viewed in isolation, is a finite-duration complex tone of duration \(T_u\). Its Fourier transform is a sinc function:
where \(\operatorname{sinc}(x) = \sin(\pi x)/(\pi x)\). The main lobe of subcarrier k is centred at \(f = k \Delta f\) with first nulls at \(f = k\Delta f \pm \Delta f\) — precisely at the centres of adjacent subcarriers. This is the geometric reason for orthogonality: each subcarrier's sinc passes through zero at all other subcarrier frequencies.
Spectral Efficiency
The CP wastes \(N_{CP}/(N + N_{CP})\) of the time resource. The spectral efficiency (fraction of time used for useful data) is:
| Numerology μ | \(N_{CP}/N\) (normal CP) | \(\eta_{CP}\) |
|---|---|---|
| 0 | 144/2048 ≈ 7.0% | 93.4% |
| 1 | 144/2048 ≈ 7.0% | 93.4% |
| 2 | 144/4096 ≈ 3.5% (FR2) | 96.6% |
| 3 | 144/8192 ≈ 1.8% (FR2) | 98.3% |
2.8 Resource Grid — Time-Frequency Structure
The transmitted signal is organized into a resource grid in the time-frequency plane:
| Entity | Time dimension | Frequency dimension | Size |
|---|---|---|---|
| Resource Element (RE) | 1 OFDM symbol | 1 subcarrier | 1 complex QAM symbol |
| Resource Block (RB) | 14 OFDM symbols (1 slot) | 12 subcarriers | 168 RE = 168 QAM symbols |
| Slot (normal CP, \(\mu=0\)) | 0.5 ms (1 ms subframe / 2) | — | 14 OFDM symbols |
| Subframe | 1 ms | — | \(2^\mu\) slots |
| Radio Frame | 10 ms | — | 10 subframes = \(10 \cdot 2^\mu\) slots |
| Max RBs (FR1 100 MHz, \(\mu=1\)) | — | 273 RBs = 3276 subcarriers | 546 MHz? No: \(273 \times 12 \times 30\,\text{kHz} = 98.28\,\text{MHz}\) |
2.9 CP-OFDM vs Plain OFDM — Why CP is Essential
Without the CP, the OFDM signal would suffer from two simultaneous impairments in a multipath channel:
| Impairment | Without CP | With CP (length ≥ channel delay spread) |
|---|---|---|
| Inter-Symbol Interference (ISI) | Last samples of symbol n corrupt first samples of symbol n+1 | Delayed copies fall inside the CP; CP is discarded → no ISI |
| Inter-Carrier Interference (ICI) | Orthogonality broken: linear conv ≠ circular conv → mixing between subcarriers | Circular convolution property holds → \(R[k] = H[k]S[k]\) → no ICI |
| Equalization complexity | Requires N×N matrix inversion or MLSE with exponential complexity | Single complex divide per subcarrier: \(\hat{S}[k] = R[k]/H[k]\) |
| Channel estimation | Must estimate full N×N channel matrix | Estimate scalar \(H[k]\) at pilot subcarriers and interpolate |
After FFT at the receiver:
\(R[k] = H[k] \cdot S[k] + W[k]\), where \(W[k] \sim \mathcal{CN}(0, \sigma^2)\)
Zero-forcing (ZF) equalizer:
\(\hat{S}_{ZF}[k] = \dfrac{R[k]}{H[k]} = S[k] + \dfrac{W[k]}{H[k]}\)
MMSE equalizer:
\(\hat{S}_{MMSE}[k] = \dfrac{H^*[k]}{|H[k]|^2 + \sigma^2/\sigma_s^2} \cdot R[k]\)
Both are \(\mathcal{O}(1)\) per subcarrier — the CP is what makes this possible.
- Orthogonality under CFO: Suppose there is a carrier frequency offset (CFO) of \(\varepsilon \cdot \Delta f\) (i.e., \(\varepsilon\) is a normalized CFO in units of subcarrier spacing). Derive the ICI power from subcarrier m onto subcarrier n as a function of \(\varepsilon\) and \(|m-n|\). At what value of \(\varepsilon\) does the ICI become comparable to the signal power? (Hint: redo the orthogonality integral with the additional phase offset \(e^{j2\pi\varepsilon\Delta f t}\) and evaluate at small \(\varepsilon\).)
- CP design for a given channel: You are designing a 5G NR deployment for an industrial hall with a measured RMS delay spread of \(\tau_{rms} = 300\,\text{ns}\) and maximum excess delay \(\tau_{\max} = 1.5\,\mu\text{s}\). Which numerology (\(\mu\)) satisfies the CP constraint \(T_{CP} > \tau_{\max}\)? What is the resulting spectral efficiency \(\eta_{CP}\)? Would \(\mu=1\) be feasible here?
- PAPR analysis: The CP-OFDM signal \(s(t) = \sum_{k=0}^{N-1} S_k e^{j2\pi k\Delta f t}\) has high Peak-to-Average Power Ratio (PAPR). Show that when all \(N\) subcarriers are co-phased (worst case), the peak power is \(N^2 \sigma_s^2\) while the average power is \(N\sigma_s^2\), yielding a maximum PAPR = \(N\) (i.e., \(10\log_{10}N\) dB). For \(N=1024\), what is this PAPR in dB? Why is PAPR a critical design constraint for the power amplifier at the transmitter?
Cyclic Prefix & Multipath Robustness
ISI/ICI suppression · circular convolution · single-tap equalization · CP overhead in 5G NR
3.1 Multipath Propagation Model
In a wireless channel the transmitted signal arrives at the receiver via multiple paths — reflections, diffractions and scattering off buildings, terrain and vehicles. Each path adds a scaled, delayed copy of the signal. The baseband-equivalent received signal is modelled as a tapped-delay line:
where \(h_l \in \mathbb{C}\) is the complex gain of the \(l\)-th path, \(\tau_l\) is its propagation delay, \(L\) is the total number of resolvable paths, and \(n(t)\) is additive white Gaussian noise (AWGN). In the continuous-time domain the channel is fully described by its channel impulse response (CIR):
The key channel parameter for OFDM design is the maximum delay spread \(\tau_{\max} = \max_l \tau_l - \min_l \tau_l\). From it we derive the coherence bandwidth — the frequency range over which the channel response is approximately flat:
3.2 Inter-Symbol Interference (ISI) — Without CP
Consider two consecutive OFDM symbols, each of duration \(T_u = N/f_s\) seconds (where \(N\) is the FFT size and \(f_s\) is the sample rate). Without a guard interval, the \(m\)-th received sample of symbol \(i\) is:
If the channel length \(L > 1\) (i.e. \(\tau_{\max} > 0\)), delayed copies of the previous symbol \(s_{i-1}\) spill into the current symbol's observation window. This Inter-Symbol Interference destroys the orthogonality of the DFT basis and cannot be removed with a simple single-tap equalizer.
3.3 Inter-Carrier Interference (ICI) — Linear vs. Circular Convolution
Even if ISI is ignored, linear convolution in the time domain breaks subcarrier orthogonality. The DFT of a linear convolution is not the pointwise product of the individual DFTs:
As a result, subcarrier \(k\) receives energy from all other subcarriers \(k' \neq k\) — this is ICI. Each received DFT bin becomes:
where \(I_{kk'}\) are inter-carrier leakage coefficients whose magnitude depends on how far the received window is misaligned with the circular structure of the channel.
3.4 CP Insertion and Removal — Proof of Circularity
The cyclic prefix is formed by copying the last \(N_{CP}\) samples of the IFFT output and prepending them. The transmitted block of length \(N + N_{CP}\) is:
Transmitter: s[0..N-1] ← IFFT(S[0..N-1]) // N-point IFFT s_tx[0..N_CP-1] ← s[N-N_CP .. N-1] // copy last N_CP samples s_tx[N_CP..N+N_CP-1] ← s[0..N-1] // append payload transmit s_tx (length N + N_CP) Receiver: r_rx[0..N+N_CP-1] ← received block discard r_rx[0..N_CP-1] // remove CP r[0..N-1] ← r_rx[N_CP..N+N_CP-1] // N samples for DFT R[0..N-1] ← FFT(r[0..N-1])
Proof: CP Enforces Circular Convolution
Assume the channel has \(L\) taps: \(h[0], h[1], \ldots, h[L-1]\) and \(N_{CP} \geq L - 1\). After the CP is stripped, the \(n\)-th received sample is:
For \(n - l \geq 0\): \(\tilde{s}[n + N_{CP} - l] = s[n - l]\). For \(n - l < 0\): the sample falls in the CP region, but because \(N_{CP} \geq L - 1\), it equals \(s[N + n - l]\) — exactly what circular indexing \(s[(n-l) \bmod N]\) gives. Therefore:
Taking the N-point DFT of both sides, and using the circular-convolution theorem:
The channel is perfectly diagonalised: each subcarrier \(k\) sees only a single complex multiplication by \(H[k]\) — no ISI, no ICI.
3.5 Single-Tap Frequency-Domain Equalization
Because \(R[k] = H[k]\,S[k] + W[k]\), recovering \(S[k]\) requires only a scalar division (or multiplication by a weight) per subcarrier. Two common equalizer choices:
ZF perfectly cancels the channel but amplifies noise at deep fades (\(|H[k]| \approx 0\)). MMSE adds a noise-regularisation term \(\sigma_w^2/\sigma_s^2\) in the denominator, trading a small residual bias for reduced noise enhancement. MMSE reduces to ZF at high SNR.
3.6 CP Overhead and 5G NR Numerology
The CP occupies \(N_{CP}\) out of every \(N + N_{CP}\) transmitted samples. The CP overhead fraction is:
In 5G NR the FFT size and CP length scale with the subcarrier spacing \(\Delta f = 2^\mu \times 15\text{ kHz}\) (numerology index \(\mu\)). Normal CP values for the reference sample rate of 30.72 MHz (\(N = 2048\) at 15 kHz):
| \(\mu\) | \(\Delta f\) (kHz) | \(N_u\) (FFT) | \(N_{CP,\text{normal}}\) | CP duration (\(\mu\)s) | Overhead \(\eta_{CP}\) |
|---|---|---|---|---|---|
| 0 | 15 | 2048 | 144 (first: 160) | 4.69 (5.21) | ~7.0 % |
| 1 | 30 | 1024 | 72 (first: 80) | 2.34 (2.60) | ~7.0 % |
| 2 | 60 | 512 | 36 (first: 40) | 1.17 (1.30) | ~7.0 % |
| 3 | 120 | 256 | 18 (first: 20) | 0.59 (0.65) | ~7.0 % |
| 2 | 60 | 512 | 128 (Extended CP) | 4.17 | ~20.0 % |
3.7 CP Length Design: Matching Delay Spread to Numerology
The fundamental constraint is \(T_{CP} \geq \tau_{\max}\). Choosing a larger \(\Delta f\) (higher numerology) reduces \(T_{CP}\) and requires a lower \(\tau_{\max}\) — fine for indoor / mmWave deployments but problematic for rural macro cells.
| Deployment | Typical \(\tau_{\max}\) | Recommended \(\mu\) / SCS | Note |
|---|---|---|---|
| Rural macro LTE/5G | 3–5 µs | \(\mu=0\), 15 kHz | CP = 4.7 µs barely covers 5 µs; extended CP if needed |
| Urban macro | 1–3 µs | \(\mu=0\) or \(\mu=1\) | Normal CP adequate; 30 kHz viable |
| Dense urban / indoor | 0.1–0.5 µs | \(\mu=2\), 60 kHz | Short delay spread; lower latency with wider SCS |
| Indoor hotspot / FR2 mmWave | < 0.1 µs | \(\mu=3\), 120 kHz | Very short propagation distances; 0.59 µs CP is ample |
| High-speed rail (Doppler) | 0.5–2 µs | \(\mu=1\) or \(\mu=2\) | Doppler drives toward wider SCS; not delay spread |
3.8 OFDM Channel Model: Per-Subcarrier Fading
With CP in place the channel is fully described by \(H[k]\) — the channel frequency response sampled at each subcarrier. In a Rayleigh fading environment, each \(H[k] \sim \mathcal{CN}(0,1)\) independently when subcarrier spacing \(\Delta f \gg B_c^{-1}\). In practice, adjacent subcarriers are correlated over a coherence bandwidth of roughly \(B_c \approx 1/(5\tau_{\max})\):
where \(R_H(f)\) is the channel's frequency correlation function and \(\tau_{\text{rms}}\) is the RMS delay spread. This frequency correlation is exploited by:
- Channel estimation interpolation — pilots on every \(P\)-th subcarrier, interpolate over \(P\) subcarriers if \(P \cdot \Delta f \ll B_c\).
- Frequency-domain MIMO precoding — beamforming weights vary slowly across subcarriers.
- Link adaptation — CQI feedback per PRB (12 subcarriers) assumes flat fading within a PRB when \(12\Delta f \ll B_c\).
3.9 Guard Interval Alternatives: ZP-OFDM and Others
The CP is not the only guard interval strategy. Zero-Padding OFDM (ZP-OFDM) appends \(N_{ZP}\) zeros after the IFFT block instead of a cyclic prefix:
| Property | CP-OFDM | ZP-OFDM |
|---|---|---|
| Guard interval | Cyclic copy (N_CP samples) | Zero padding (N_ZP zeros) |
| TX power on guard | Full power (CP carries signal) | Zero — no TX power wasted |
| ISI suppression | Complete if \(N_{CP} \geq L-1\) | Complete if \(N_{ZP} \geq L-1\) |
| Circular convolution | Yes — exact DFT diagonalisation | No — requires overlap-add or special equalizer |
| Equalization complexity | \(O(N)\) scalars | \(O(N \cdot L)\) or iterative |
| Spectral efficiency | \(N/(N+N_{CP})\) | \(N/(N+N_{ZP})\) — same overhead |
| Used in | LTE, 5G NR, Wi-Fi (802.11a/g/n/ac/ax) | Some 802.11 modes, UWB, research |
- Q3.1 — CP length selection: A 5G NR deployment in a dense urban environment measures a maximum excess delay of \(\tau_{\max} = 2.1\,\mu\text{s}\) and an RMS delay spread of \(\tau_{\text{rms}} = 0.7\,\mu\text{s}\). (a) What is the minimum numerology \(\mu\) that guarantees no ISI with the normal CP? (b) If the operator uses \(\mu = 2\) (60 kHz SCS), what fraction of the CP is "wasted" (i.e., not needed to cover the delay spread)? (c) Calculate the coherence bandwidth and verify that a single 12-subcarrier PRB at \(\mu = 1\) is within it.
- Q3.2 — Circular convolution proof: Consider a 3-tap channel \(h = [0.8, 0.4j, -0.2]\) and an OFDM symbol with \(N = 8\) subcarriers and \(N_{CP} = 2\). The frequency-domain symbol is \(S[k] = e^{j\pi k/4}\) for \(k = 0,\ldots,7\). (a) Compute the IFFT output \(s[n]\), form the CP-padded transmission, apply the channel, strip the CP and take the FFT. (b) Verify that \(R[k] = H[k]\cdot S[k]\) where \(H[k]\) is the 8-point DFT of \(h\) zero-padded to length 8. (c) What would \(R[k]\) look like if \(N_{CP} = 1\) (too short)?
- Q3.3 — MMSE vs ZF tradeoff: In a frequency-selective channel, subcarrier \(k_0\) undergoes a deep fade: \(|H[k_0]|^2 = 0.01\) (−20 dB). The operating SNR is 20 dB (\(\sigma_s^2/\sigma_w^2 = 100\)). (a) Compute the post-equalization SNR for ZF and MMSE equalizers on subcarrier \(k_0\). (b) On a non-faded subcarrier with \(|H[k]|^2 = 1\), compare the two equalizers again. (c) Explain qualitatively why a modern 5G receiver uses MMSE-IRC rather than ZF, especially in interference-limited scenarios.
PAPR — The Peak-to-Average Power Ratio Problem
Why OFDM's greatest strength creates its most painful hardware constraint
4.1 PAPR Definition
The Peak-to-Average Power Ratio quantifies how much the instantaneous power of a transmitted signal can exceed its average power. For a continuous-time OFDM signal s(t):
An OFDM symbol with N subcarriers, each bearing unit-power:
Worst case: all N subcarriers align in phase at the same instant, so the instantaneous amplitude is N times the per-subcarrier amplitude:
| N (subcarriers) | PAPRmax (linear) | PAPRmax (dB) |
|---|---|---|
| 64 | 64 | 18.1 dB |
| 256 | 256 | 24.1 dB |
| 512 | 512 | 27.1 dB |
| 1024 | 1024 | 30.1 dB |
| 4096 | 4096 | 36.1 dB |
4.2 Statistical Analysis of PAPR
For large N, the Central Limit Theorem (CLT) applies: the sum of many independent random variables approaches a Gaussian distribution. The real and imaginary parts of s(t) each become approximately i.i.d. Gaussian, so the envelope |s(t)| follows a Rayleigh distribution.
The instantaneous power |s(t)|² follows an exponential distribution with mean Pavg. Treating the N sub-samples as independent (Nyquist rate), the Complementary CDF (CCDF) of PAPR is:
Operating PAPR at 0.1% outage (CCDF = 10−3): solving 1 − (1 − e−γ)N = 10−3:
| N | γ0.1% (linear) | γ0.1% (dB) |
|---|---|---|
| 64 | ≈ 2.23 | ≈ 3.5 dB |
| 256 | ≈ 3.61 | ≈ 5.6 dB |
| 512 | ≈ 4.30 | ≈ 6.3 dB |
| 1024 | ≈ 5.00 | ≈ 7.0 dB |
4.3 Why PAPR Matters — The HPA Problem
A High Power Amplifier (HPA) is characterised by a linear region up to its saturation point Psat, beyond which the output clips and severe nonlinear distortion is introduced. The transmit chain must operate with sufficient back-off to keep the signal within the linear region.
where OBO is the Output Back-Off. For reliable operation, OBO must equal (or exceed) the PAPR at the required outage level.
Quantifying Efficiency Loss
For a Class A amplifier, the theoretical drain efficiency is:
At saturation (OBO = 0 dB): ηA = 50%. With a 10 dB back-off:
Class B amplifiers have a more favourable back-off characteristic:
Peak efficiency ≈ 78.5% at saturation, dropping to ≈ 24.8% at 10 dB OBO — still painful, but significantly better than Class A.
4.4 PAPR Reduction Techniques
Six major categories of PAPR reduction exist, each with distinct trade-offs in complexity, spectral efficiency, and performance.
The simplest approach: hard-clip the signal when its envelope exceeds a threshold Aclip, then apply a bandpass filter to restore the out-of-band spectrum.
The Clipping Ratio (CR) is defined as:
Trade-off: Smaller CR → more PAPR reduction, but more clipping noise → higher BER and in-band distortion. Filtering removes out-of-band regrowth but may re-introduce small peaks. Iterative clipping and filtering (ICF) can approach CR = 1.4 with acceptable BER degradation (<1 dB at 10−3 BER for QPSK).
Generate U candidate OFDM symbols by multiplying the frequency-domain data vector X by U different phase rotation vectors P(u). Transmit the candidate with the lowest PAPR.
Side information: The receiver must be informed which rotation vector was used to recover the data. This costs ⌈log₂U⌉ bits per OFDM symbol. U = 4 reduces PAPR by ≈3 dB; U = 8 by ≈4 dB, but complexity grows linearly in U.
Partition the N subcarriers into V disjoint sub-blocks X(v). Apply phase factors b(v) ∈ {±1, ±j} to each sub-block, then sum:
Optimise over all combinations of b(v). With 4 phase values and V sub-blocks: 4V candidates (minus one degree of freedom = 4V−1 IFFT evaluations). Total IFFT count: V × 4V−1. V = 4 → 16 IFFTs; V = 8 → 8192 IFFTs — exponential complexity growth.
Reserve a small subset R of subcarriers (the "peak-reduction tones") that carry no data. Use these tones to construct a cancelling signal c(t) that reduces peaks in the time domain:
No side information needed — the receiver simply ignores the reserved tones. The iterative kernel algorithm converges in 5–10 iterations to 1–2 dB PAPR reduction. TR is specified in LTE Release 8 (PDSCH PAPR reduction) and DVB-T2. Typically |R| = 1–2% of total subcarriers.
Extend outer constellation points further outward (away from the decision boundary) to create destructive interference with peak-contributing subcarriers. Inner points are left unchanged to preserve minimum Euclidean distance.
Advantages: No rate loss, no side information, compatible with adaptive modulation. Typical PAPR reduction: 1.5–3 dB. Works best with QAM constellations of order 16 or higher (more outer points available for extension).
The most impactful PAPR reduction: use a DFT precoding stage before the IFFT to convert the signal into a single-carrier-like waveform while retaining OFDM's multi-carrier framework. Covered in detail in §4.5.
4.5 DFT-s-OFDM (SC-FDMA) — PAPR Advantage in Detail
Transmitter Architecture
The DFT-s-OFDM transmitter adds a single DFT precoding stage before the conventional OFDM IFFT:
| Stage | Operation | Output Size |
|---|---|---|
| 1. DFT Precoding | M-point DFT of M data symbols: X̃ = FM · d | M frequency bins |
| 2. Subcarrier Mapping | Map M DFT outputs onto M contiguous subcarriers of N-point grid (N ≫ M) | N subcarrier values |
| 3. N-point IFFT | Standard OFDM IFFT | N time samples |
| 4. CP insertion | Add cyclic prefix of length NCP | N + NCP samples |
The cascade of M-point DFT followed by the N-point IFFT is equivalent to transmitting M data symbols as a single-carrier signal occupying a bandwidth of M · Δf. Mathematically, the time-domain output is:
Substituting, s[n] reduces to a linearly-modulated single-carrier signal (with rectangular pulse-shaping), explaining the dramatically lower PAPR.
PAPR Numbers
| Parameter | CP-OFDM | DFT-s-OFDM |
|---|---|---|
| PAPR at 0.1% CCDF (QPSK) | ~10–11 dB | ~5–6 dB |
| PAPR at 0.1% CCDF (16QAM) | ~11–12 dB | ~6–7 dB |
| Reduction vs CP-OFDM | — | 4–6 dB |
Why 5G NR Uses Both Waveforms
| Direction | Waveform | Primary Reason |
|---|---|---|
| Downlink (gNB → UE) | CP-OFDM | Base station has abundant power; MIMO spatial multiplexing requires independent per-subcarrier precoding; complexity/power at gNB is manageable. |
| Uplink (UE → gNB) | DFT-s-OFDM (default) + CP-OFDM (optional) | UE battery and PA efficiency are critical; 4–6 dB PAPR reduction → ~3× better PA efficiency → longer battery life. UE uses simpler SISO or small MIMO. |
4.6 Waveform Comparison: CP-OFDM vs DFT-s-OFDM
| Property | CP-OFDM | DFT-s-OFDM (SC-FDMA) |
|---|---|---|
| PAPR (0.1% outage) | ~10–12 dB | ~5–7 dB (4–6 dB gain) |
| PA efficiency | Low (~5% Class A at 10 dB OBO) | Higher (~15% Class A at 5 dB OBO) |
| Spectral efficiency | High — independent per-subcarrier modulation | Slightly lower — DFT precoding adds constraint |
| Receiver complexity | One-tap FEQ per subcarrier (simple) | FEQ + IDFT at receiver (slightly higher) |
| MIMO spatial multiplexing | Fully supported — arbitrary rank | Limited to single-layer (rank-1) in standard use |
| Frequency diversity | Limited (localized scheduling) | Inherent via DFT spreading |
| Subcarrier allocation | Any subset (distributed or localized) | Must be contiguous (LFDMA/DFDMA special cases) |
| Standards (DL) | LTE, 5G NR, Wi-Fi, WiMAX, DVB-T2 | Not used in DL (no PA efficiency advantage) |
| Standards (UL) | 5G NR UL (optional), Wi-Fi | LTE UL (mandatory), 5G NR UL (default) |
| Sensitivity to frequency offset | High (ICI across all subcarriers) | High (same OFDM ICI mechanism) |
- Q1 — CCDF calculation: An OFDM system uses N = 512 subcarriers. Using the approximation Pr(PAPR > γ) ≈ 1 − (1 − e−γ)N, calculate the PAPR threshold γ (in dB) at a 0.01% outage probability (CCDF = 10−4). Then compare with N = 2048 (5G NR numerology μ=0, 20 MHz BW). How much additional back-off is required for the larger FFT? Hint: solve e−γ ≈ ln(N)/N at small outage values.
- Q2 — Efficiency comparison: A 5G NR base station transmits 40 W average output power. The HPA is Class B. Compare the DC power consumption when: (a) transmitting CP-OFDM DL at 10 dB OBO, and (b) transmitting DFT-s-OFDM UL at 5 dB OBO. Using Class B efficiency formula ηB = (π/4) · 10−OBO/20, calculate the wasted power in each case. What is the CO₂ implication if 10,000 base stations operate continuously?
- Q3 — SLM vs PTS trade-off: You are designing a PAPR reduction scheme for an N = 256 OFDM system targeting 3 dB PAPR reduction. Compare SLM with U = 4 versus PTS with V = 4 sub-blocks and 4 phase values. (a) How many IFFT operations does each method require per OFDM symbol? (b) How many bits of side information does SLM require per symbol? (c) Which method is preferable for a latency-constrained real-time system, and why might PTS be preferred for offline coding?
Frequency & Timing Synchronization
CFO, ICI, and Recovery — Why a 1% offset breaks OFDM orthogonality
5.1 Why Synchronization is Critical for OFDM
Single-carrier systems are relatively robust to small frequency and timing errors: a slight offset merely shifts the constellation slightly. OFDM is fundamentally different. Its entire performance rests on orthogonality between subcarriers — the property that every subcarrier integrates to zero over every other subcarrier's tone. That orthogonality is fragile.
A normalized CFO of just \(\varepsilon = 0.1\) (10% of subcarrier spacing) degrades SIR by roughly 10–12 dB. At \(\varepsilon = 0.2\) the system is effectively unusable without correction. This makes synchronization a first-order design requirement in any practical OFDM transceiver.
5.2 Carrier Frequency Offset (CFO) — Model and Effect
Sources of CFO
- Oscillator mismatch: TX and RX crystal oscillators run at nominally the same frequency but with small tolerances (ppm-level). At 28 GHz, even 1 ppm mismatch = 28 kHz offset — nearly two 15 kHz subcarriers.
- Doppler shift: Relative motion between TX and RX causes \(f_D = v \cdot f_c / c\). At vehicular speeds (100 km/h) and millimeter-wave, Doppler can be several kHz.
- Phase noise: Local oscillator phase noise acts as a time-varying instantaneous frequency offset, producing both common phase error and ICI.
Signal Model with CFO
Let the transmitted baseband OFDM symbol be \(s(t)\). After passing through a channel with carrier frequency offset \(\Delta f\), the received signal before the DFT is:
Normalizing to subcarrier spacing \(\Delta f_{\rm sc}\), define the normalized CFO:
With an \(N\)-point DFT and sample index \(n = 0, 1, \ldots, N-1\), the received time-domain samples are:
ICI Power Formula
After the DFT, the output on subcarrier \(k\) is:
where \(I_m = \frac{\sin(\pi(\varepsilon - m))}{N \sin(\pi(\varepsilon-m)/N)} e^{j\pi(\varepsilon-m)(N-1)/N}\) is the ICI coefficient from subcarrier \(l = k+m\). For small \(\varepsilon\), the total ICI power relative to signal power is:
The resulting Signal-to-ICI Ratio (SIR) due to CFO alone is:
5.3 Integer vs Fractional CFO
CFO can be decomposed as \(\varepsilon = \varepsilon_{\rm int} + \varepsilon_{\rm frac}\) where \(\varepsilon_{\rm int} \in \mathbb{Z}\) and \(|\varepsilon_{\rm frac}| \leq 0.5\).
Integer CFO (\(\varepsilon_{\rm int}\))
- Shifts all subcarriers by a whole number of positions in the DFT output.
- No ICI — orthogonality is preserved since the subcarrier grid merely shifts cyclically.
- Data is decoded from wrong subcarrier indices → symbol errors, but no spectral spreading.
- Can be detected via known pilots at fixed subcarrier positions (ambiguity from cyclic shift).
- Correction: apply a frequency shift of \(-\varepsilon_{\rm int} \cdot \Delta f_{\rm sc}\) before the DFT.
Fractional CFO (\(\varepsilon_{\rm frac}\))
- Breaks orthogonality — every subcarrier receives interference from all others.
- ICI proportional to \(\varepsilon_{\rm frac}^2\) as derived above.
- Even \(|\varepsilon_{\rm frac}| = 0.01\) causes \(\sim\)–35 dB SIR ceiling — problematic for high-order QAM.
- Must be corrected in time domain (before DFT) by multiplying by \(e^{-j 2\pi \hat{\varepsilon}_{\rm frac} n / N}\).
- Primary target of all fine frequency synchronization algorithms.
Chart: ICI Power vs Normalized CFO
The plot below shows the ICI-to-signal ratio (in dB) as a function of normalized CFO \(\varepsilon\). Both the exact expression (computed numerically from the DFT output formula) and the small-angle approximation \(P_{\rm ICI}/P_s \approx \pi^2\varepsilon^2/3\) are shown. The rapid degradation with even modest CFO is the defining challenge of OFDM synchronization.
Figure 5.1 — ICI-to-signal ratio vs normalized CFO. At \(\varepsilon=0.1\) the SIR floor is ~14.8 dB; at \(\varepsilon=0.3\) it collapses to ~5 dB, making 16-QAM or higher impractical without correction.
5.4 CFO Estimation Algorithms
Time-Domain: CP-Based Estimation
The cyclic prefix is a copy of the last \(N_{\rm CP}\) samples of the OFDM symbol. Without CFO, \(r[n] = r[n + N]\) for \(n\) in the CP window. With CFO, this relationship becomes:
The cross-correlation between the CP and its copy is:
The CFO estimate is extracted from the phase:
Frequency-Domain: Pilot-Based Estimation
If pilot subcarriers transmit known symbols \(P[k]\), the received pilot after DFT is:
where the CFO-induced phase on pilot \(k\) in symbol \(m\) is approximately \(\phi \approx 2\pi\varepsilon m\) (linear in symbol index). Comparing received to expected pilot phases across successive symbols:
Schmidl-Cox Algorithm
The Schmidl-Cox algorithm uses a special training symbol where the first half is identical to the second half in the time domain (achieved by transmitting non-zero data only on even subcarriers). Define the timing metric:
where \(R(d) = \sum_{m=0}^{N/2-1} |r[d+m+N/2]|^2\) is a normalization term. The peak of \(M(d)\) locates the symbol start, and the phase of \(P(d)\) at the peak gives the fractional CFO estimate:
5.5 Symbol Timing Offset (STO)
Even with perfect frequency synchronization, incorrect DFT window placement causes its own impairments. Define the symbol timing offset \(\delta = \delta_{\rm int} + \delta_{\rm frac}\) in samples.
Integer STO
An integer timing offset of \(\delta_{\rm int}\) samples corresponds to a cyclic shift of the DFT input. By the DFT shift theorem, this appears as a linear phase ramp across subcarriers:
This is a multiplicative phase that varies linearly with subcarrier index \(k\). It can be absorbed into the channel estimate during equalization — so integer STO within the CP window causes no ICI and no ISI, only a correctable phase ramp.
Fractional STO and Beyond-CP Offset
If the timing offset causes the DFT window to extend beyond the CP boundary (i.e., it captures part of the previous or next symbol), ISI and ICI result:
- ISI: energy from adjacent OFDM symbol enters the DFT window.
- ICI: the channel impulse response is no longer circular with respect to the DFT window, destroying the diagonal channel model in frequency domain.
Timing Estimation via CP Correlation
Exploit the CP structure: the correlation between the received signal and a delayed version of itself peaks when the delay equals the DFT length \(N\):
The index \(\hat{d} = \arg\max_d |\Lambda(d)|^2 / E(d)^2\) (where \(E(d)\) normalizes energy) gives the estimated symbol start position. The correlation magnitude forms a plateau of width \(N_{\rm CP} - L_{\rm ch} + 1\) — the safe timing window.
Chart: CP-Based Timing & Frequency Sync Correlation
The chart below simulates the normalized CP correlation metric \(M(d)\) as a function of timing offset from the true symbol start. The plateau region corresponds to the safe CP window; the peak marks the optimal DFT start. A multipath channel (4-tap, delay spread 4 samples) is included to show real-world behavior.
Figure 5.2 — Normalized CP correlation metric vs timing offset (samples). \(N=64\), \(N_{\rm CP}=16\), SNR = 10 dB, 4-tap Rayleigh channel with max delay 4 samples. The plateau region has width \(\approx N_{\rm CP} - L_{\rm ch} = 12\) samples. True symbol start is at offset = 0.
5.6 Phase Noise
Model
Local oscillator phase noise introduces a time-varying phase \(\phi(t)\) that modulates the received signal. In discrete time, the received sample is:
After the DFT, phase noise has two effects that can be separated via the Fourier expansion of \(e^{j\phi[n]}\):
Common Phase Error (CPE)
- DC component of phase noise: \(\Phi_0 = \frac{1}{N}\sum_n \phi[n]\).
- Rotates all subcarriers by the same angle \(\Phi_0\).
- Equivalent to a single complex multiplier on the entire symbol.
- Correctable using pilot symbols distributed across frequency: estimate \(\hat{\Phi}_0\) from pilot phase, then de-rotate.
- This is why 5G NR defines DMRS and PTRS for CPE tracking.
ICI from Phase Noise
- Non-DC components of phase noise cause ICI between subcarriers.
- For Wiener (random walk) phase noise with variance \(\sigma_\phi^2\) per sample:
- Increases with symbol duration \(T_u\) — larger OFDM symbols (smaller SCS) suffer more from phase noise.
- Particularly severe at mmWave (FR2): oscillator stability is worse, and 5G NR mmWave uses PTRS to track and correct residual ICI.
5.7 Doppler Effect and High-Mobility Scenarios
Relative motion between TX and RX at velocity \(v\) causes a Doppler frequency shift:
Worked Example: 28 GHz at 100 km/h
\(f_D = 27.78 \times 28 \times 10^9 / (3 \times 10^8) = 2593 \text{ Hz}\)
With 15 kHz SCS (\(\mu=0\)): \(\varepsilon = f_D / \Delta f_{\rm sc} = 2593/15000 = 0.173\)
\(\mathrm{SIR} \approx 3/(\pi^2 \times 0.173^2) \approx 10.1 \text{ dB}\) — severely limits 64-QAM or 256-QAM without correction.
With 60 kHz SCS (\(\mu=2\)): \(\varepsilon = 2593/60000 = 0.043\) → \(\mathrm{SIR} \approx 164 \approx 22.1 \text{ dB}\) — acceptable.
This is the fundamental reason 5G NR defines multiple numerologies (subcarrier spacings). Mobility determines the appropriate SCS:
| \(\mu\) | SCS (kHz) | Slot duration (ms) | Max Doppler @ 28 GHz, 120 km/h | Normalized CFO \(\varepsilon\) | SIR (dB) |
|---|---|---|---|---|---|
| 0 | 15 | 1.0 | 3111 Hz | 0.207 | 8.6 |
| 1 | 30 | 0.5 | 3111 Hz | 0.104 | 14.5 |
| 2 | 60 | 0.25 | 3111 Hz | 0.052 | 20.5 |
| 3 | 120 | 0.125 | 3111 Hz | 0.026 | 26.5 |
Table 5.1 — Normalized Doppler CFO and resulting SIR vs 5G NR numerology at 28 GHz, 120 km/h.
5.8 Synchronization in 5G NR
5G NR implements a hierarchical synchronization architecture. The UE acquires synchronization in stages, from coarse to fine, using dedicated reference signals.
Synchronization Signal Block (SSB)
The SSB consists of PSS + SSS + PBCH DMRS. It is transmitted periodically (default 20 ms period) and enables initial cell search.
- Zadoff-Chu (ZC) sequence of length 127 on subcarriers −63 to +63.
- Three possible ZC root indices: \(u \in \{25, 29, 34\}\) (indicating 3 physical layer cell IDs modulo 3).
- ZC sequences have ideal autocorrelation → correlation peak gives symbol timing.
- PSS detection gives: coarse timing + integer CFO (frequency hypothesis search) + \(N_{\rm ID}^{(2)} \in \{0,1,2\}\).
- m-sequence based sequence of length 127, also on 127 subcarriers.
- 336 possible sequences → encodes \(N_{\rm ID}^{(1)} \in \{0,\ldots,335\}\).
- Full Physical Cell ID: \(N_{\rm ID}^{\rm cell} = 3 N_{\rm ID}^{(1)} + N_{\rm ID}^{(2)}\), giving 1008 unique cell IDs.
- Also provides frame timing (half-frame detection).
- Embedded within PDSCH/PUSCH data regions at known time-frequency positions.
- Enables channel estimation at the receiver for equalization.
- Also corrects Common Phase Error (CPE) from residual CFO and phase noise within each slot.
- Density and pattern depend on channel conditions (Type A: front-loaded, Type B: additional).
- Used only at FR2 (mmWave, above 24.25 GHz) where phase noise is most severe.
- Sparse in frequency (1 or 2 subcarriers per PRB block) but dense in time (every or every-other symbol).
- Tracks residual CPE after DMRS correction, symbol by symbol.
- Particularly important for 256-QAM at FR2 where phase noise margin is very tight.
Study Questions
- A 5G NR system operates at 3.5 GHz with 30 kHz SCS (\(\mu=1\)). A UE moves at 200 km/h. (a) Calculate the maximum Doppler shift \(f_D\). (b) Compute the normalized CFO \(\varepsilon\). (c) Using the SIR approximation \(\mathrm{SIR} \approx 3/(\pi^2\varepsilon^2)\), find the maximum achievable SNR ceiling due to ICI. (d) Would this SIR be acceptable for 64-QAM (which requires approximately 22 dB SNR)?
- The CP-based CFO estimator computes \(\hat{\varepsilon} = \frac{1}{2\pi}\angle(\gamma)\) where \(\gamma = \sum_{n=0}^{N_{\rm CP}-1} r^*[n] r[n+N]\). (a) What is the unambiguous estimation range of this estimator in terms of normalized CFO? (b) If the true CFO is \(\varepsilon = 1.3\) subcarrier spacings, what would the CP estimator report? (c) Describe a two-stage approach (combining CP-based and pilot-based estimation) to handle CFOs larger than ±0.5 subcarrier spacings.
- Explain why 5G NR defines Phase Tracking RS (PTRS) for FR2 (mmWave) but not for FR1 (sub-6 GHz). Your answer should address: (a) how oscillator phase noise power spectral density scales with carrier frequency, (b) the difference between CPE correction via DMRS and per-symbol CPE tracking via PTRS, and (c) why larger subcarrier spacings (used at FR2 for Doppler robustness) actually help with phase noise even though the symbol duration \(T_u\) is shorter.
Channel Estimation & Equalization
Single-tap equalization · ZF & MMSE · Pilot grids · DFT smoothing · 5G NR DMRS
6.1 Channel Model for OFDM
After cyclic prefix (CP) removal and the N-point FFT, the multipath channel reduces to a set of independent, parallel scalar channels — one per subcarrier. This is the fundamental reason OFDM is so amenable to simple equalisation. If the channel impulse response (CIR) is h[l], \; l=0,\ldots,L-1 with L \le N_{CP}, the received sample on subcarrier k is:
where H[k] = \sum_{l=0}^{L-1} h[l]\,e^{-j2\pi kl/N} is the DFT of the CIR evaluated at bin k, X[k] is the transmitted QAM symbol, and N[k] \sim \mathcal{CN}(0,\sigma_n^2) is complex AWGN.
The single-tap equalizer produces the estimate:
The design of the equalizer coefficient W[k] trades off between residual channel distortion (if W[k]H[k] \ne 1) and noise amplification (large |W[k]| boosts W[k]\,N[k]). The two canonical solutions are the Zero-Forcing and MMSE equalizers.
6.2 Zero-Forcing (ZF) Equalizer
The Zero-Forcing equalizer forces the inter-symbol interference to zero by perfectly inverting the channel:
This gives \hat{X}_{\mathrm{ZF}}[k] = X[k] + N[k]/H[k], so the residual noise power on subcarrier k is:
The post-equalization SNR at each subcarrier is:
Achieves optimal performance only when the channel is flat (|H[k]|=\text{const}). In practice, diversity techniques (coding, frequency hopping) are needed to protect against nulls.
6.3 MMSE Equalizer
The Minimum Mean Square Error (MMSE) equalizer minimizes \mathbb{E}\left[|\hat{X}[k]-X[k]|^2\right]. Assuming X[k] \sim \mathcal{CN}(0,\sigma_s^2):
where \mathrm{SNR} = \sigma_s^2/\sigma_n^2 is the average signal-to-noise ratio. The post-equalization SINR is:
Limiting cases
- High SNR: 1/\mathrm{SNR} \to 0 ⇒ W_{\mathrm{MMSE}} \to H^*[k]/|H[k]|^2 = 1/H[k] = W_{\mathrm{ZF}}
- Low SNR / channel null: |H[k]|^2 \ll 1/\mathrm{SNR} ⇒ W_{\mathrm{MMSE}} \approx H^*[k] \cdot \mathrm{SNR} \to 0 (gracefully suppress noisy subcarrier)
- Very low SNR: MMSE → matched filter W_{\mathrm{MF}}[k] = H^*[k]/|H[k]|
Residual bias
MMSE is biased: it introduces a scaling error on X[k]. The estimate must be corrected:
Alternatively, the bias is absorbed into the soft demapper's LLR computation in modern receivers.
ZF vs MMSE Equalizer — Noise Enhancement at Channel Nulls
A frequency-selective channel with a deep fade near subcarrier 40. The ZF response |W_{\mathrm{ZF}}[k]| = 1/|H[k]| blows up at the null, while the MMSE response |W_{\mathrm{MMSE}}[k]| remains bounded by trading off noise against residual distortion.
6.4 Pilot-Based Channel Estimation
To apply the equalizer, the receiver must first estimate H[k] across all subcarriers. The standard approach inserts known pilot symbols at selected time-frequency positions and interpolates between them.
Pilot Insertion Patterns
Comb pilots
Pilots placed every N_f subcarriers, across all or many OFDM symbols. Suited for rapidly time-varying channels (high Doppler) because the channel can be tracked continuously in time. Used in LTE CRS and 5G NR DMRS Type 1.
- Pilot spacing in frequency: \Delta_f \le B_c (coherence BW)
- Overhead proportional to 1/N_f
Block pilots
Entire OFDM symbols are dedicated as pilot symbols. Suited for slow fading channels with long coherence time. Simpler interpolation in frequency; extrapolation required at symbol boundaries in time.
- Pilot symbol spacing in time: \Delta_t \le T_c (coherence time)
- One pilot symbol every N_t data symbols
Least-Squares Estimation at Pilot Positions
At pilot subcarrier index k_p, where the transmitted symbol X_p[k_p] is known:
The noise term on the estimate has variance \sigma_n^2/|X_p|^2. Using unit-power pilots (|X_p|^2=1) gives the minimum-variance unbiased LS estimator. Full-power pilots (boosted above data power) improve estimate quality at the cost of increased PAPR and interference to adjacent subcarriers.
Interpolation Methods
| Method | Complexity | MSE | Notes |
|---|---|---|---|
| Linear | Low | Moderate | Fast, suitable for slowly varying channels |
| Spline (cubic) | Medium | Low | Smooth estimate; can overshoot at edges |
| DFT-based | Medium | Low–Very Low | Leverages CIR sparsity; see §6.5 |
| LMMSE (Wiener) | High | Optimal | Requires channel statistics; see §6.8 |
2D Nyquist Pilot Density Requirements
To avoid aliasing in the estimated channel, the pilot grid must satisfy the 2D Nyquist sampling criterion:
where B_c \approx 1/(2\pi\tau_{\max}) is the coherence bandwidth and T_c \approx 1/(2\pi B_D) is the coherence time, \tau_{\max} is the RMS delay spread, and B_D is the Doppler spread. Equivalently in discrete terms:
6.5 DFT-Based Channel Estimation
The CIR has at most L taps, where L \ll N for typical channels (e.g., L=16 taps in a 2048-point FFT). This sparsity can be exploited to dramatically reduce estimation noise:
Obtain \hat{H}_{\mathrm{LS}}[k_p] for all N_p pilots across the OFDM symbol.
\hat{h}[l] = \mathrm{IDFT}\{\hat{H}_{\mathrm{LS}}[k]\}. The true CIR occupies taps l=0,\ldots,L-1; higher lags contain only noise.
Set \hat{h}[l] = 0 for l \ge L (rectangular window) or apply a smooth taper to avoid Gibbs ringing.
\hat{H}_{\mathrm{DFT}}[k] = \mathrm{DFT}\{\hat{h}_{\mathrm{trunc}}[l]\}.
The noise reduction factor is N_p/L. For N_p = 512 pilots and L = 16 taps, the noise power in the estimate is reduced by 15 dB. The MSE of the DFT estimator is:
6.6 DMRS in 5G NR
The Demodulation Reference Signal (DMRS) is the primary channel estimation pilot in 5G NR for PDSCH (downlink data) and PUSCH (uplink data). Its design reflects lessons from LTE and the demand for ultra-low-latency channel estimation.
DMRS Type 1
- Comb-2 pattern: pilots on alternate subcarriers within each RB
- Supports up to 4 orthogonal ports (OCC × frequency comb)
- Lower pilot density → lower overhead for single-layer transmissions
- Default for most NR deployments
DMRS Type 2
- Comb-3 pattern: pilots grouped in sets of 2 consecutive subcarriers
- Supports up to 6 orthogonal ports
- Better suited for massive MIMO with many spatial layers
- Higher pilot density in each port's allocation
Front-Loaded DMRS
NR places DMRS in the first 1 or 2 OFDM symbols of each slot (or mini-slot). This allows the UE to start channel estimation and data demodulation without waiting for the full slot — critical for low-latency URLLC. In LTE, CRS was spread across the entire subframe, incurring higher decoding latency.
Additional DMRS Positions (High Mobility)
For high-speed scenarios (vehicular, HST), the channel varies significantly within a slot.
NR supports additional DMRS positions (up to 4 per slot), densifying the pilot
grid in the time direction to track faster fading. This is configured via
maxLength and additionalPosition in the DMRS configuration IE.
DMRS Sequence Generation
NR DMRS symbols are generated from a length-31 Gold sequence (same pseudorandom generator used for scrambling), initialized per cell ID, slot, and symbol index. The complex symbols are:
where c(n) is the Gold sequence output. This BPSK-like construction gives |d(m)|=1 — constant modulus — which minimises PAPR and ensures consistent pilot power across all positions.
| Signal | Use | Direction | Pattern |
|---|---|---|---|
| DMRS | Data demodulation (per-layer CE) | DL+UL | Comb (Type 1/2) in time-first symbols |
| PTRS | Phase noise tracking | DL+UL | Sparse in frequency, dense in time |
| CSI-RS | CQI/PMI/RI feedback, beam management | DL | Configurable grid, up to 32 ports |
| SRS | UL channel sounding, reciprocity | UL | Comb in last symbols of slot |
| CRS (LTE) | DL CE, RRM measurements | DL | Comb, 4 ports, all subframes |
6.7 Time-Varying Channels & the Scattering Function
Real wireless channels are non-stationary: scatterers move, causing the channel to vary in both delay and Doppler. The Wide-Sense Stationary Uncorrelated Scattering (WSSUS) model characterises the channel by its scattering function S(\tau,\nu), the power spectral density in delay-Doppler space.
Delay spread & coherence bandwidth
\sigma_\tau = RMS delay spread. Subcarrier spacing \Delta f \ll B_c is needed to ensure flat fading per subcarrier.
Doppler spread & coherence time
f_D = maximum Doppler frequency, v = UE velocity, f_c = carrier frequency.
The 2D Nyquist pilot density requirements in discrete terms (subcarrier spacing \Delta f_{\mathrm{sc}}, OFDM symbol duration T_s = T_u + T_{CP}):
Violating these bounds causes aliasing in the estimated channel — the interpolated H[k] will contain errors that persist regardless of SNR, creating an estimation error floor.
6.8 LMMSE (Wiener) Channel Estimation
The Linear MMSE (also called Wiener filter) estimator exploits the statistical correlation of the channel across frequency and time — the channel does not change independently at each subcarrier; nearby subcarriers are correlated with correlation length \sim B_c/\Delta f_{\mathrm{sc}}.
2D Wiener Filter
Let \mathbf{y}_p = [Y[k_{p,1}],\ldots,Y[k_{p,N_p}]]^T be the received pilots and \mathbf{h}_p the channel at pilot positions. The LMMSE estimate of the full channel vector \mathbf{h} is:
where \mathbf{R}_{H H_p} is the cross-correlation matrix between all subcarriers and pilot subcarriers, and \mathbf{R}_{H_p H_p} is the autocorrelation at pilot positions. The SNR-scaled identity accounts for the noise in the LS estimate.
Complexity challenge
Full LMMSE requires inversion of an N_p \times N_p matrix — complexity \mathcal{O}(N_p^3). For N_p = 128, that is ~2M operations per OFDM symbol — too expensive for real-time receivers.
Separable 1D approximation
Approximate the 2D filter as two cascaded 1D Wiener filters (frequency domain, then time domain). Reduces complexity to \mathcal{O}(N_f^2 + N_t^2). Most 4G/5G baseband implementations use this approach with pre-computed filter taps.
The 1D frequency-domain Wiener filter coefficients at pilot spacing \Delta k are derived from the Clarke/Jakes power delay profile. For an exponential PDP with RMS delay spread \sigma_\tau:
Pilot Grid & Interpolated Channel Estimate
Left: time-frequency resource grid showing pilot positions (yellow dots) vs data subcarriers (blue background). Right: the interpolated channel magnitude |\hat{H}(t,f)| surface reconstructed from the pilot estimates — a smoothly varying response reflecting the channel's coherence in both dimensions.
6.9 Practical Channel Estimation in LTE / 5G NR
Real deployments layer multiple reference signals, each optimised for a specific estimation purpose:
LTE Downlink
- CRS (Cell-specific Reference Signals): Legacy DL pilots, present in every subframe regardless of data scheduling. Comb pattern with 6-subcarrier frequency separation and 7-symbol time separation per antenna port (4 ports: port 0–3). Used for channel estimation, RSRP/RSRQ measurement, and PDSCH demodulation in non-precoded modes.
- DMRS (UE-specific RS): Precoded pilot for PDSCH in transmission modes 7–9. Located in symbol 4 (normal CP) of the subframe. Same precoding as data → transparent to the UE's equalizer.
- CSI-RS: Introduced in LTE-A Release 10 for large-scale MIMO feedback. Configurable density, up to 8 ports.
5G NR — Full Reference Signal Suite
- DMRS (PDSCH/PUSCH): Primary CE pilot; front-loaded (symbols 2–3 of slot); Types 1/2 with 1 or 2 CDM groups; mapped to up to 12 orthogonal ports via OCC/CDM. Supports both single-symbol and double-symbol DMRS.
- PTRS (Phase Tracking RS): Extremely sparse in frequency (1 per PRG of 2/4 RBs), dense in time (every symbol). Corrects CPE (common phase error) from local oscillator phase noise — dominant impairment at mmWave (FR2).
- CSI-RS: Flexible configuration: 1–32 ports, various densities (3, 1, 1/2 RE per RB per port). Multiple uses: NZP-CSI-RS for CSI measurement (CQI/PMI/RI), TRS (tracking RS) for timing/frequency tracking, CSI-RS for beam management (L1-RSRP).
- SRS (Sounding RS): UL; wideband channel sounding for UL scheduling, antenna selection, and TDD DL/UL reciprocity-based precoding. Transmitted in last symbols of slot; configurable bandwidth, comb factor (2 or 4), and periodicity.
Study Questions
- A 5G NR PDSCH allocation has 52 RBs (624 subcarriers), SCS = 30 kHz, and the channel has RMS delay spread \sigma_\tau = 100\,\mathrm{ns} and UE velocity v = 120\,\mathrm{km/h} at f_c = 3.5\,\mathrm{GHz}. Calculate the maximum allowable DMRS subcarrier spacing \Delta f_{\mathrm{pilot}} (in subcarriers) and the maximum pilot symbol spacing \Delta t_{\mathrm{pilot}} (in OFDM symbols) to satisfy the 2D Nyquist criterion. How many pilot RE per slot are required, and what percentage overhead does this represent for a 14-symbol slot?
- A ZF equalizer is applied to a 5G NR channel where subcarrier k^* has a deep fade of |H[k^*]| = -20\,\mathrm{dB} (relative to the mean channel gain). If the operating SNR is 20 dB, what is the post-equalization SNR on subcarrier k^* for (a) ZF and (b) MMSE equalizers? By how many dB does MMSE outperform ZF on this subcarrier? What does this imply for the required coding rate when frequency-selective fading is present?
- In the DFT-based channel estimator (§6.5), explain the bias-variance trade-off when choosing the truncation window length L. If the true CIR has 10 significant taps but you set L = 8 (underestimate) vs L = 32 (overestimate), quantify the impact on (a) estimation MSE and (b) channel reconstruction bias, assuming AWGN with SNR = 15 dB, N_p = 256 pilots. What practical rule of thumb should guide the choice of L in a 5G NR receiver?
LTE (4G) OFDM — OFDMA, SC-FDMA, Resource Grid & Air Interface
Long Term Evolution (LTE), standardised in 3GPP Release 8 (2008) and refined through Releases 9–15, brought OFDM from Wi-Fi into wide-area cellular networks for the first time. The downlink uses OFDMA (Orthogonal Frequency Division Multiple Access) while the uplink uses SC-FDMA (Single-Carrier FDMA, also called DFT-s-OFDM) — a deliberate asymmetry driven by the need to conserve UE battery life. Understanding LTE's parameter choices is the essential prerequisite for 5G NR's flexible numerology (§8).
7.1 LTE OFDM Parameters
LTE fixes the subcarrier spacing at 15 kHz for all channel bandwidths. This was chosen to balance Doppler tolerance (wider spacing is better) against multipath tolerance (wider spacing shortens the useful symbol time, reducing CP efficiency). 15 kHz maps to a useful symbol duration of exactly \(T_u = 1/\Delta f = 66.7\;\mu\text{s}\).
| Parameter | 1.4 MHz | 3 MHz | 5 MHz | 10 MHz | 15 MHz | 20 MHz |
|---|---|---|---|---|---|---|
| Subcarrier spacing \(\Delta f\) | 15 kHz | 15 kHz | 15 kHz | 15 kHz | 15 kHz | 15 kHz |
| FFT size \(N_{FFT}\) | 128 | 256 | 512 | 1024 | 1536 | 2048 |
| Occupied subcarriers | 72 | 180 | 300 | 600 | 900 | 1200 |
| Resource Blocks (RBs) | 6 | 15 | 25 | 50 | 75 | 100 |
| Useful symbol duration \(T_u\) | 66.7 µs (= 1/15 kHz) — identical for all BW | |||||
| Normal CP length | 4.7 µs first symbol / 5.2 µs first symbol of slot · 144 or 160 samples @ \(f_s = 30.72\;\text{MHz}\) (20 MHz case) — ratio 144/2048 | |||||
| Extended CP length | 16.67 µs · 512 samples @ 30.72 MHz — 6 symbols/slot instead of 7 | |||||
| Slot duration | 0.5 ms — 7 symbols (normal CP) or 6 symbols (extended CP) | |||||
| Subframe duration | 1 ms = 2 slots (Transmission Time Interval, TTI) | |||||
| Frame duration | 10 ms = 10 subframes = 20 slots | |||||
| Sampling rate \(f_s\) (20 MHz) | 30.72 MHz = 2048 × 15 kHz — exact integer relationship between FFT size and sampling clock | |||||
7.2 LTE OFDMA — Downlink Architecture
In the LTE downlink, the eNodeB (base station) generates all subcarriers simultaneously via a single N-point IFFT. Multiple UEs are multiplexed by assigning each a non-overlapping set of Resource Blocks (RBs) within each 1 ms subframe — this is Orthogonal Frequency Division Multiple Access.
Resource Block Definition
Frequency-Selective Scheduling
At each TTI (1 ms), the eNodeB scheduler assigns RBs to UEs based on Channel Quality Indicator (CQI) feedback. A UE reports the CQI it measures on each subband (a group of RBs). The scheduler exploits multi-user diversity: different UEs experience fading peaks at different frequencies, so the aggregate system capacity is maximised by assigning each subband to whichever UE sees it best — the proportional-fair or maximum C/I criterion.
7.3 SC-FDMA (DFT-s-OFDM) — Uplink Architecture
Why Not OFDMA in the Uplink?
OFDMA's chief drawback is a high Peak-to-Average Power Ratio (PAPR) — typically 8–12 dB (see §4). A high PAPR forces the UE power amplifier to operate far below its saturation point (large backoff), drastically reducing power efficiency. For a battery-powered UE, this directly reduces talk time. The eNodeB is a mains-powered infrastructure node where PAPR is a manageable engineering problem; the UE is not.
DFT-s-OFDM Signal Processing Chain
Input bits
→ QAM mapper (QPSK / 16-QAM / 64-QAM) → M data symbols: d[0..M-1]
→ M-point DFT → D[k] = Σ_{m=0}^{M-1} d[m] · e^{-j2πmk/M}, k=0..M-1
→ Subcarrier mapping (M occupied out of N total, N > M)
LFDMA: map D[k] to contiguous subcarriers [k_0 .. k_0+M-1]
IFDMA: map D[k] to every (N/M)-th subcarrier (not used in LTE)
→ N-point IFFT → time-domain single-carrier-like waveform s[n]
→ Insert cyclic prefix (same structure as OFDM)
→ DAC + RF → air interface
The key insight: the DFT "pre-spreads" the M QAM symbols across M subcarriers so that the final IFFT output looks like a single-carrier signal. The time-domain signal has the envelope statistics of single-carrier (low PAPR) rather than a sum of many independent sinusoids (high PAPR).
M Must Be "Highly Composite"
The M-point DFT is implemented as an FFT. For computational efficiency, M must factor into small primes. LTE mandates that M (the number of assigned subcarriers, always a multiple of 12) must be expressible as \(M = 2^a \cdot 3^b \cdot 5^c\) with \(a,b,c \geq 0\). This is called a "good" or highly-composite number constraint. Valid values in LTE include 12, 24, 36, 48, 60, 72, 96, 120, 144, 180, … up to 1200. Any M that requires a prime factor > 5 is forbidden, which means RB allocations of 7, 11, 13 RBs (= 84, 132, 156 subcarriers) are not schedulable.
7.4 LTE Reference Signals
| Reference Signal | Full Name | Direction | Purpose | Density / Periodicity |
|---|---|---|---|---|
| CRS | Cell-specific RS | DL | Channel estimation (non-precoded), cell search, RSRP measurement, handover, legacy UE support | Every subframe; RE positions shift by cell ID mod 6. 4 REs per RB per slot for 2-port CRS (ports 0–1) |
| DMRS | Demodulation RS (UE-specific) | DL (Rel-10+) & UL | Channel estimation for precoded transmission; transparent to beamforming (precoded with data) | DL DMRS in symbols 4 and/or 7 of RB. UL DMRS in symbol 4 of each slot (Zadoff-Chu sequence) |
| CSI-RS | Channel State Information RS | DL | CQI/PMI/RI feedback; beamforming; interference measurement (ZP-CSI-RS nulls interferers) | 1, 2, 4, or 8 ports; configurable periodicity 5/10/20/40/80 ms; 1 RE per RB per port |
| SRS | Sounding Reference Signal | UL | Frequency-selective channel sounding; allows eNodeB to estimate UL channel and perform frequency-selective scheduling + link adaptation | Last symbol of subframe; configurable periodicity 2–320 ms; configurable bandwidth (wideband or partial-band) |
| PSS / SSS | Primary / Secondary Synchronisation Signal | DL | Cell search and synchronisation; PSS carries physical layer ID group (0–2); SSS carries cell ID within group (0–167) | PSS in subframe 0 and 5 (FDD); occupies central 62 subcarriers; Zadoff-Chu (PSS) and length-31 m-sequence (SSS) |
7.5 LTE MIMO and OFDM Interaction
OFDM converts a wideband frequency-selective MIMO channel into a bank of narrowband flat-fading MIMO subchannels — one per subcarrier. This is the most elegant property of OFDM for MIMO: the spatial processing (precoding, detection) can be performed independently per subcarrier.
| MIMO Mode | LTE Transmission Mode (TM) | Max Layers | Precoding | Feedback Required |
|---|---|---|---|---|
| Single Antenna | TM1 | 1 | None | CQI |
| Transmit Diversity (SFBC) | TM2 | 1 | SFBC (Space-Frequency Block Coding) | CQI |
| Open-Loop Spatial Mux | TM3 | 4 | Cyclic Delay Diversity + codebook | CQI, RI |
| Closed-Loop Spatial Mux | TM4 | 4 DL / 2 UL | Codebook PMI (3GPP 36.213 tables) | CQI, PMI, RI |
| MU-MIMO | TM5 | 1 (per UE) | Single-layer PMI, orthogonal UE pairing | CQI, PMI |
| Beamforming (TDD) | TM7–TM9 | 8 (TM9) | UE-specific DMRS — eNodeB estimates UL channel for DL precoder (TDD reciprocity) | CQI, PMI, RI (subband) |
Space-Frequency Block Coding (SFBC) in LTE TM2 encodes pairs of symbols across two antenna ports and two adjacent subcarriers, achieving 2nd-order diversity without CSI at the transmitter. It is the LTE equivalent of Alamouti coding but mapped to the frequency dimension (rather than time) to avoid the time variation between symbols.
7.6 LTE Capacity — Shannon Analysis
For LTE 20 MHz with 2×2 MIMO (2 independent spatial streams):
Practical LTE Peak vs Theoretical
| Overhead Source | Approximate Loss | Explanation |
|---|---|---|
| Cyclic Prefix (Normal) | ~7% capacity loss | CP = 144/2048 ≈ 6.7% of symbol duration is guard; carries no data |
| CRS Pilot Overhead (2 ports) | ~4.8% | 4 REs per RB-pair occupied by CRS; 4/168 × 2 ports in 2×2 system |
| PDCCH / Control Region | ~14–21% | First 1–3 OFDM symbols per subframe reserved for DL control (PDCCH, PCFICH, PHICH) |
| Guard Subcarriers | ~10% | 1200 occupied out of 1536 (for 15 MHz; 1200 out of ~1365 for 20 MHz) — spectral shaping |
| MCS Granularity & BLER Target | ~10–15% | Finite MCS set forces operating below Shannon bound; 10% BLER target adds ~1 dB margin |
| Practical Peak DL (20 MHz 2×2) | ~75 Mbps | 3GPP baseline; operator measurements typically 60–80 Mbps in good conditions |
7.7 LTE vs 5G NR — Comparison Preview
Full 5G NR treatment follows in §8. This teaser table highlights the key differences.
| Parameter | LTE (4G) | 5G NR (FR1 + FR2) |
|---|---|---|
| Subcarrier Spacing | Fixed 15 kHz | Flexible: 15 / 30 / 60 / 120 / 240 kHz (numerology µ = 0–4) |
| DL Waveform | CP-OFDM (OFDMA) | CP-OFDM (OFDMA only) |
| UL Waveform | DFT-s-OFDM (SC-FDMA) only | CP-OFDM or DFT-s-OFDM (UE-configurable) |
| Max BW (single CC) | 20 MHz | 100 MHz (FR1 sub-6 GHz), 400 MHz (FR2 mmWave) |
| Max MIMO Ports (DL) | 4 ports (TM9: 8 CSI-RS ports) | 32 ports (FR1 massive MIMO), 256 in O-RAN AAU deployments |
| Reference Signals | Always-on CRS (overhead ~5–10%) | No always-on CRS; DMRS + on-demand CSI-RS (lean design) |
| TTI | Fixed 1 ms subframe | Mini-slot (2–7 symbols) to full 14-symbol slot; scalable latency |
| Peak DL Rate | ~75 Mbps (20 MHz 2×2) | >1 Gbps (100 MHz 4×4, 256-QAM) |
| Carrier Aggregation | Up to 5 CC (100 MHz) | Up to 16 CC (can span FR1+FR2) |
7.8 Interactive Charts
- CP Efficiency Trade-off: LTE normal CP is 4.7 µs (6.7% overhead) while extended CP is 16.7 µs (20% overhead). Given that LTE uses 15 kHz subcarrier spacing with \(T_u = 66.7\;\mu\text{s}\), calculate the maximum excess multipath delay (in meters of path-length difference) that each CP can protect against. Under what real-world propagation scenario — urban canyon, suburban LOS, rural mountainous, or maritime — would you expect to need extended CP, and why?
- SC-FDMA PAPR vs Capacity: A UE is allocated M = 60 subcarriers (5 RBs) for uplink transmission using SC-FDMA (LFDMA). The total FFT size is N = 2048 (20 MHz). (a) Show that 60 is a "good" number (express as \(2^a \cdot 3^b \cdot 5^c\)). (b) If the UE's power amplifier has a 1 dB compression point of 23 dBm and the required power backoff for OFDMA is 8 dB but only 3 dB for SC-FDMA, what is the maximum output power the UE can deliver for each waveform, and what is the coverage-area ratio (assume path loss \(\propto d^{3.5}\))?
- MIMO Overhead and Practical Capacity: A 4×4 LTE system (TM9) uses 8-port CSI-RS. At 20 MHz with 4 spatial layers, the theoretical peak (Shannon, 4 layers × 20 MHz × log₂(1 + 20 dB SINR)) is approximately 532 Mbps. (a) Accounting for CP overhead (6.7%), PDCCH control region (2 symbols out of 14), CRS overhead for 4 antenna ports (~9.5%), and CSI-RS overhead for 8 ports (1 RE per port per RB, once every 5 ms), estimate the net spectral efficiency per layer. (b) Compare this to the LTE Category 6 UE peak rate of 300 Mbps specified in 3GPP TS 36.306. What additional practical factors account for the remaining gap?
8.1 Numerology Framework: Δf = 2μ × 15 kHz
3GPP Release 15 introduced flexible numerology as the foundation of 5G NR air interface design. Unlike LTE's single fixed 15 kHz subcarrier spacing, NR defines five numerologies indexed by μ ∈ {0,1,2,3,4}, each doubling the subcarrier spacing and halving the OFDM symbol duration relative to the previous one. This single design parameter simultaneously controls SCS, slot duration, cyclic prefix length, and FFT size — making the entire air interface scalable from coverage- optimised macro cells at 700 MHz up to low-latency mmWave links at 39 GHz.
| μ | SCS Δf | T_u (useful) | N-CP (normal) [samples @30.72 MHz·2μ] | T_CP normal | T_CP extended | Slot dur. | Slots/subframe | Slots/frame | Typical FFT size @common Fs | Use case |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 15 kHz | 66.67 μs | 144 / 160 (1st) | 4.69 μs | — | 1.0 ms | 1 | 10 | 2048 @ 30.72 MHz | LTE-compatible; FR1 macro; coverage |
| 1 | 30 kHz | 33.33 μs | 144 / 160 (1st) | 2.34 μs | — | 0.5 ms | 2 | 20 | 2048 @ 61.44 MHz | Most common FR1 sub-6 GHz; data |
| 2 | 60 kHz | 16.67 μs | 144 / 160 (1st) | 1.17 μs | 4.17 μs (ext.) | 0.25 ms | 4 | 40 | 2048 @ 122.88 MHz | FR1 & FR2; low-latency; unlicensed |
| 3 | 120 kHz | 8.33 μs | 144 / 160 (1st) | 0.586 μs | — | 0.125 ms | 8 | 80 | 2048 @ 245.76 MHz | FR2 mmWave; data; <100 m range |
| 4 | 240 kHz | 4.17 μs | 144 / 160 (1st) | 0.293 μs | — | 0.0625 ms | 16 | 160 | 2048 @ 491.52 MHz | FR2 reference signals (SSB) only |
Normal CP sample counts: For μ=0 at Fs=30.72 MHz: CP = 144 samples (≈4.69 μs) for symbols 1-6 and 8-13; CP = 160 samples (≈5.21 μs) for symbols 0 and 7 (first symbol of each half-slot gets a slightly longer CP to fill the 1 ms subframe boundary exactly). Extended CP (μ=2) uses CP = 512 samples at 245.76 MHz → 4.17 μs, for special downlink subframes in unpaired spectrum only.
8.2 Why Flexible Numerology? — The Delay-Doppler Trade-off
The two fundamental channel impairments that numerology must balance are delay spread (multipath) and Doppler spread (mobility). They pull the design in opposite directions.
Multipath echoes arrive over a delay spread τmax. The CP must exceed τmax to prevent ISI. Since T_CP is proportional to T_u (fixed ratio ~7%), using lower SCS → longer T_u → longer T_CP in absolute time → more multipath protection.
Urban macro: τmax ≈ 1–5 μs → SCS ≤ 30 kHz preferred.
Indoor: τmax ≈ 100 ns → SCS up to 240 kHz viable.
Mobile UEs create Doppler shift fD = v·f_c/c. The channel must remain quasi-static over one OFDM symbol for flat-fading ICI-free reception. Higher SCS → shorter symbol → less time for channel to decorrelate.
120 km/h @28 GHz: fD = 3.1 kHz → SCS ≥ 30 kHz needed (fD/Δf < 0.1).
500 km/h (HSR) @3.5 GHz: fD = 1.62 kHz → SCS ≥ 15 kHz.
The product fD · τmax is a channel-specific constant; when it approaches 1, no single numerology satisfies both constraints simultaneously — requiring techniques like OTFS or channel estimation with ICI mitigation. In practice, 5G NR allows different BWPs on the same carrier to use different numerologies, and mixed-numerology operation within a carrier is possible (with guard bands to limit ICI between BWPs).
| Scenario | f_c | τ_max (typ.) | f_D (typ.) | Recommended μ | Rationale |
|---|---|---|---|---|---|
| Rural macro, coverage layer | 700 MHz | 5–10 μs | <100 Hz | 0 (15 kHz) | Long CP; low Doppler; LTE coexistence |
| Urban macro, sub-3 GHz | 2.1 GHz | 1–3 μs | 200–500 Hz | 1 (30 kHz) | CP ≥ 2.34 μs clears multipath; most common |
| Urban sub-7 GHz / dense | 3.5 GHz | 0.5–1 μs | 300–800 Hz | 1–2 (30/60 kHz) | Low latency; short CP OK for small cells |
| mmWave indoor/outdoor | 28 GHz | 50–200 ns | 1–3 kHz | 3 (120 kHz) | Very short τ_max; high f_D; 0.586 μs CP sufficient |
| mmWave dense indoor | 39 GHz | 10–50 ns | 500 Hz | 3–4 (120/240 kHz) | Tiny delay spread; μ=4 for beam mgmt RS only |
| High-speed rail | 3.5 GHz | 0.5–2 μs | 1–2 kHz | 1–2 (30/60 kHz) | Doppler dominates; shorter symbol needed |
8.2-A Interactive Chart: Numerology Metrics Comparison (μ=0..4)
8.3 5G NR Resource Grid and Scheduling
The 5G NR physical resource grid preserves the LTE concept of Resource Blocks (12 subcarriers each) but extends it with flexible time-domain scheduling.
One subcarrier × one OFDM symbol. Carries one complex modulation symbol (QPSK/16QAM/64QAM/256QAM/1024QAM).
12 subcarriers × 1 slot (14 symbols). Same as LTE. Minimum scheduling granularity is 1 RB in frequency.
14 OFDM symbols. Duration = 1 ms / 2μ. Two half-slots of 7 symbols. Normal scheduling granularity.
2, 4, or 7 symbols for ultra-low latency (URLLC). Can start at any symbol boundary. Pre-emption indicator signals release of resources.
| BW | μ=0 (15 kHz) | μ=1 (30 kHz) | μ=2 (60 kHz) |
|---|---|---|---|
| 5 MHz | 25 RB | 11 RB | — |
| 10 MHz | 52 RB | 24 RB | 11 RB |
| 20 MHz | 106 RB | 51 RB | 24 RB |
| 50 MHz | 270 RB | 133 RB | 66 RB |
| 100 MHz | — | 273 RB | 135 RB |
8.4 CP-OFDM in 5G NR — UL MIMO Enabled
A key departure from LTE is that 5G NR mandates CP-OFDM for both downlink and uplink, whereas LTE restricted the UL to SC-FDMA (DFT-spread OFDM). This change has profound implications for uplink MIMO.
- Single-carrier DFT precoding forces frequency-contiguous resource allocation
- PAPR ≈ 7–8 dB (vs 10–12 dB for CP-OFDM)
- No UL spatial multiplexing (MIMO layers) — single antenna transmission per UE
- Power amplifier efficiency advantage for coverage-limited UEs
- Non-contiguous resource allocation possible (frequency hopping, multi-cluster)
- Up to 4 UL MIMO layers per UE (Category 3 UE)
- Consistent waveform with DL → simplified channel estimation and precoding
- Massive MIMO: gNB can transmit up to 32 DL layers (multi-user MIMO)
- DFT-s-OFDM still available optionally for coverage (see §8.5)
8.5 DFT-s-OFDM — Coverage Maximisation via PAPR Reduction
DFT-spread OFDM is an optional UL waveform in 5G NR, retained from LTE's SC-FDMA for scenarios where UE transmit power is the bottleneck. A UE reports its DFT-s-OFDM capability; the gNB configures it via higher-layer signalling when the UE is coverage-limited (e.g., cell edge, deep indoor).
| Waveform | Modulation | PAPR (99.9% CCDF) | Notes |
|---|---|---|---|
| CP-OFDM | QPSK | ~10.5 dB | High peak due to multicarrier summation |
| CP-OFDM | 16QAM | ~11 dB | Slightly higher due to non-constant envelope |
| CP-OFDM | 256QAM | ~12 dB | Even higher amplitude variation |
| DFT-s-OFDM | QPSK | ~6 dB | Significant PAPR reduction |
| DFT-s-OFDM | π/2-BPSK | ~3 dB | Near-optimal; best coverage waveform |
| DFT-s-OFDM | π/2-BPSK + spectral shaping | ~1–2 dB | 3GPP Rel-16 enhancement; used in RedCap |
| Feature | DFT-s-OFDM | CP-OFDM |
|---|---|---|
| PAPR | 3–7 dB (waveform dep.) | 10–12 dB |
| Coverage | Better (cell edge) | Standard |
| UL MIMO layers | 1 only (rank-1) | Up to 4 |
| Resource allocation | Contiguous RBs only | Any allocation |
| Frequency hopping | Limited | Flexible |
| Spectral efficiency | Lower at high SNR | Higher at high SNR |
| UE capability req. | Optional (reported) | Mandatory |
| Usage | Coverage-limited UL | Default UL and all DL |
8.6 Frequency Ranges: FR1 (Sub-7 GHz) vs FR2 (mmWave)
| Parameter | FR1 | FR2 |
|---|---|---|
| Frequency range | 450 MHz – 7.125 GHz | 24.25 GHz – 52.6 GHz |
| Common designation | Sub-6 GHz / Sub-7 GHz | mmWave |
| Maximum channel BW | 100 MHz | 400 MHz |
| Maximum aggregated BW | 1 GHz (CA) | 2 GHz (CA) |
| Supported SCS | 15, 30, 60 kHz | 60, 120, 240 kHz |
| SCS for SSB | 15, 30 kHz | 120, 240 kHz |
| Max SSB beams/half-frame | 8 (L_max=8, >3 GHz) | 64 (L_max=64) |
| Path loss exponent (NLOS) | 3.5–4.5 (PL ∝ f²·d³·⁵) | 4–6 (heavy shadowing) |
| Free space path loss @1 km | ~105 dB @3.5 GHz | ~131 dB @28 GHz |
| Typical cell radius | 100 m – 5 km | 10 – 200 m |
| UE antenna config | 2–4 antennas (pattern diversity) | Up to 8 panels (phased arrays) |
| gNB antenna config | Up to 256 TRX (Massive MIMO) | Up to 1024 elements (beamforming) |
| Duplex mode | FDD or TDD | TDD only |
| Phase noise (PN) | Manageable, minimal PTRS | Severe (PN ∝ f²); PTRS mandatory |
| Oxygen absorption | Negligible | High @60 GHz (15 dB/km) |
| Rain attenuation | Negligible below 10 GHz | 10–20 dB/km @28 GHz heavy rain |
| Penetration loss (glass) | 1–5 dB | 30–50 dB (mmWave reflects off glass) |
| Key challenge | Interference management, spectral efficiency | Beam management, phase noise, blockage |
8.7 Phase Tracking Reference Signal (PTRS) — Combating LO Phase Noise
At mmWave frequencies, local oscillator (LO) phase noise becomes a dominant impairment. Unlike frequency-selective fading (corrected per-subcarrier by channel equalisation), phase noise manifests as Common Phase Error (CPE) — a phase rotation that is identical across all subcarriers within one OFDM symbol — plus Inter-Carrier Interference (ICI) from the tails of the phase noise spectrum.
| Parameter | DL (PDSCH) | UL (PUSCH) |
|---|---|---|
| Time density (symbols) | Every 1 or 2 symbols (depends on SCS and MCS) | Same as DL |
| Frequency density | 1 port per 2, 4, or 16 RBs (ptrsFrequencyDensity) | Same; adapts to scheduled BW |
| Modulation threshold (time density) | MCS ≥ 29 → every symbol; MCS 10–28 → every 2 symbols | Same thresholds |
| BW threshold (freq density) | BW > N_RB0: 2 RB; > N_RB1: 4 RB; else: 16 RB | Same |
| PTRS port association | Associated with DMRS antenna port (not standalone) | Same |
| Required at low MCS? | No (disabled for QPSK/low 16QAM) | No |
| Phase noise benefit | CPE estimation + correction per symbol | CPE correction for UL transmission |
8.8 Bandwidth Parts (BWP) — Adaptive UE Bandwidth Operation
A Bandwidth Part (BWP) is a contiguous set of PRBs on a carrier, characterised by a numerology (SCS + CP type) and a physical resource block offset. Each UE has up to 4 configured BWPs per carrier direction (DL/UL), with exactly one active BWP at any time. BWPs enable UEs to operate on a bandwidth narrower than the full carrier, reducing RF processing and power consumption during low-activity periods.
Used during random access (RACH) and initial system information reception. Same SCS as SSB or CORESET#0. Width = CORESET#0 bandwidth. UE does not need to receive full carrier during RA.
Fallback BWP when inactivity timer (bwp-InactivityTimer) expires. Can be narrower than data BWP to save power. Configured in RRCReconfiguration. If not configured, BWP#0 is default.
Current operational BWP. Switched via DCI field (2-bit BWP indicator in DCI 1_1/0_1) or RRC reconfiguration. UE only needs to receive signals within active BWP bandwidth.
Secondary cell (SCell) can be configured with a dormant BWP — UE measures channel on narrow BWP, reducing power. Activation triggered by MAC-CE or DCI. Faster than full SCell activation.
| Trigger | Mechanism | Switching Latency | Use Case |
|---|---|---|---|
| DCI-based (downlink) | DCI 1_1 BWP indicator field (2 bits → BWP ID 0-3) | 3–4 slots | Dynamic bandwidth adaptation, service change |
| DCI-based (uplink) | DCI 0_1 BWP indicator field | 3–4 slots | UL rate adaptation |
| Inactivity timer | bwp-InactivityTimer (1–2560 ms) | Timer expiry | Power saving: switch to narrow default BWP after silence |
| RRC reconfiguration | RRCReconfiguration message | ~10 ms (RRC delay) | Service reconfiguration, handover |
| MAC-CE activation | MAC control element (SCell) | 2–3 slots | Dormant BWP activation on SCells |
| Random access | Automatic switch to initial BWP | Immediate | RACH procedure, beam failure recovery |
8.9 5G NR vs LTE — Comprehensive Air Interface Comparison
| Feature | LTE (Rel-8) | LTE-A (Rel-12) | 5G NR (Rel-15) | 5G NR (Rel-17+) |
|---|---|---|---|---|
| SCS options | 15 kHz only | 15 kHz only | 15/30/60/120/240 kHz | Same + 480/960 kHz (NTN) |
| Max channel BW | 20 MHz | 100 MHz (CA) | 100 MHz (FR1) / 400 MHz (FR2) | Same + 1.6 GHz (mmWave) |
| DL waveform | CP-OFDM | CP-OFDM | CP-OFDM | CP-OFDM |
| UL waveform | SC-FDMA (DFT-s-OFDM) only | SC-FDMA only | CP-OFDM (primary) + DFT-s-OFDM | Same |
| DL MIMO layers | 4 (Rel-8) | 8 (Rel-10) | 8 per UE (up to 32 MU-MIMO) | Same (Type-II CSI) |
| UL MIMO layers | 1 (no UL MIMO) | 4 (Rel-10) | 4 (CP-OFDM) / 1 (DFT-s) | Same |
| Slot duration | 1 ms (fixed) | 1 ms (fixed) | 0.5 ms / 2μ | Same |
| Mini-slot | No (sub-frame only) | No | 2/4/7 symbols | Same |
| Latency (user-plane) | ~4–8 ms | ~3–6 ms | <1 ms (URLLC) | <0.5 ms (XR/IIoT) |
| CP overhead/slot | ~6.7% (144/2048) | Same | ~10.4% (14 symbols incl. long CP) | Same |
| Reference signals | CRS (always-on, 4-port) | CSI-RS added | DMRS (on-demand), CSI-RS, SSB, PTRS, TRS | Same + PRS (positioning) |
| Always-on RS | CRS (all RBs all time) | CRS + CSI-RS | SSB only (periodic, configurable) | Same |
| Carrier aggregation | No (Rel-8) | Up to 5 CC | Up to 16 CC | Up to 16 CC |
| Frequency range | 700 MHz – 3.5 GHz | Up to 5.9 GHz | 450 MHz – 52.6 GHz | Same + 71 GHz (Rel-17) |
| Duplex flexibility | FDD & TDD (separate bands) | Same | FDD, TDD, SDL, SUL, dynamic TDD | Same + FDR (full duplex research) |
| Numerology flexibility | None | None | Full (per BWP, per carrier) | Same |
| Phase noise compensation | Not needed | Not needed | PTRS (FR2) | Enhanced PTRS |
| Beam management | Not supported | Limited (CoMP) | Full (SSB/CSI-RS beam mgmt) | BFR enhancements |
8.9-A Interactive Chart: FR1 vs FR2 — Coherence Bandwidth & Coherence Time
Study Questions — §8
- Numerology trade-off analysis: A vehicular UE operates at 120 km/h on a 28 GHz mmWave link (τ_max ≈ 100 ns, carrier freq = 28 GHz). Calculate (a) the Doppler spread f_D, (b) the coherence time T_c ≈ 0.423/f_D, and (c) which NR numerology μ best satisfies both f_D / Δf < 0.1 and T_CP > τ_max. Then explain why the same scenario at 700 MHz would use a different μ despite identical vehicle speed.
- DFT-s-OFDM coverage calculation: A cell-edge UE at 2.1 GHz is 8 km from the gNB. Using CP-OFDM QPSK (PAPR ≈ 10.5 dB) the UE cannot close the link budget. Switching to DFT-s-OFDM with π/2-BPSK (PAPR ≈ 3 dB) effectively increases transmit power. Assuming path loss PL ∝ d3.5 and the UE's PA is running 7.5 dB below saturation with CP-OFDM, calculate the new maximum range after switching to DFT-s-OFDM with π/2-BPSK. Explain why DFT-s-OFDM is restricted to rank-1 UL MIMO.
- BWP power saving & switching latency: A gNB configures a UE with two BWPs: BWP#1 (100 MHz, μ=1, active during data burst) and BWP#2 (10 MHz, μ=0, default dormant BWP). The bwp-InactivityTimer is set to 10 ms. (a) After a 5 Mbps data burst of 500 ms ends, what event triggers BWP switching and when does it occur? (b) A new DCI arrives 2 ms after the timer expires — what is the total interruption time? (c) How does mixed-numerology operation between BWP#1 and a neighbouring UE's BWP (both on the same carrier) generate ICI, and what guard band is required between them?
OFDM Variants & Alternatives — FBMC, f-OFDM, UFMC, OTFS
9.1 CP-OFDM Limitations That Motivate Alternatives
CP-OFDM has dominated 4G and 5G NR because of its elegant frequency-domain equalization (one complex multiply per subcarrier), but four structural weaknesses motivate the search for alternatives.
9.1.1 Cyclic-Prefix Overhead
The CP of length \(N_{CP}\) is a copy of the last \(N_{CP}\) samples of each OFDM symbol of length \(N\). The spectral efficiency penalty is:
In 5G NR numerology \(\mu=0\) (SCS = 15 kHz) the normal CP is 144 samples over a 2048-sample FFT, giving \(\eta = 2048/(2048+144) \approx 93.4\%\). At \(\mu=4\) (SCS = 240 kHz) the extended CP reduces efficiency further. For ultra-dense small-cell deployments, even this 6–7% loss translates to meaningful capacity reduction.
9.1.2 High Out-of-Band Emissions (Sinc Sidelobe Problem)
The rectangular window applied to each OFDM symbol has a frequency-domain response of the form \(\mathrm{sinc}(fT)\). The first sidelobe is only \(-13.3\) dB below the main lobe. For a multi-carrier signal the composite power spectral density decays as:
This slow \(\sim 1/f^2\) rolloff forces regulators to mandate wide guard bands around the occupied spectrum — 0.5–1 MHz in LTE/5G NR — wasting radio resources. Adjacent-channel interference into other operators or asynchronous users (unlicensed, NB-IoT) is a direct consequence.
9.1.3 Peak-to-Average Power Ratio (PAPR)
An OFDM signal with \(N\) independently modulated subcarriers can constructively superpose. In the worst case:
For \(N=1024\) this is 30 dB — far in excess of what any practical power amplifier can tolerate linearly. Statistical measures (complementary CDF at \(10^{-3}\)) give PAPR \(\approx 10\text{–}11\) dB for QPSK/16-QAM, still requiring significant power back-off that degrades energy efficiency.
9.1.4 Sensitivity to Doubly-Dispersive Channels
CP-OFDM restores orthogonality only when the channel delay spread \(\tau_{\max} \le T_{CP}\) and the Doppler spread \(\nu_{\max} \ll \Delta f\). In vehicular or LEO-satellite channels both conditions can be violated simultaneously. Inter-carrier interference (ICI) from Doppler shifts destroys the diagonal structure of the channel matrix, requiring costly equalizers and reducing throughput.
9.2 FBMC — Filter Bank Multi-Carrier
FBMC replaces the rectangular pulse of OFDM with a well-designed prototype filter \(p(t)\) applied independently to each subcarrier. The transmitted signal is:
where \(d_{k,m}\) are real-valued OQAM symbols, \(T\) is the symbol period, \(\Delta f = 1/T\) is subcarrier spacing, and \(\varphi_{k,m} = \tfrac{\pi}{2}(k+m)\) is the phase offset used in Offset-QAM.
9.2.1 Prototype Filter Design — PHYDYAS Filter
The most widely studied prototype filter for FBMC is the PHYDYAS filter, specified by its frequency-domain coefficients. For overlapping factor \(K=4\) (filter spans 4 multicarrier symbol periods) the frequency samples are:
The prototype filter impulse response in continuous time has length \(L_p = K\cdot N\) samples (where \(N\) is the FFT size). The resulting power spectral density sidelobe is suppressed to below \(-40\) dB, compared to only \(-13\) dB for the rectangular window (OFDM).
9.2.2 OQAM — Offset QAM
The fundamental constraint of FBMC is that perfect reconstruction without a CP requires the transmit basis functions to satisfy the Nyquist condition only in the real field. Complex QAM symbols produce unavoidable intrinsic imaginary interference from neighboring subcarriers. OQAM circumvents this by transmitting the in-phase (I) and quadrature (Q) components of each QAM symbol on staggered time grids separated by \(T/2\):
where \(c_{k,n}\) is the complex QAM symbol on subcarrier \(k\) at time index \(n\). The real orthogonality condition for FBMC-OQAM is:
The imaginary interference \(\mathrm{Im}\{\langle g_{k,m}, g_{k',m'}\rangle\}\) from the nearest time-frequency neighbors is non-zero but known — it can be cancelled with auxiliary pilots or by treating it as structured noise in MMSE equalization.
- No CP — 100% spectral efficiency in time
- Sidelobe suppression >40 dB vs ~13 dB (OFDM)
- No guard bands needed between coexisting users
- Ideal for cognitive radio and fragmented spectrum
- Per-subcarrier filter design enables flexible resource allocation
- Only real-valued symbols per subcarrier (OQAM constraint)
- Latency: filter length \(= K \times N\) (4× symbol duration for \(K=4\))
- Channel estimation requires special pilot structures (scattered pilots contaminated by imaginary interference)
- MIMO extension is non-trivial (imaginary cross-talk between antennas)
- Not standardized in 3GPP; implementation complexity higher than OFDM
9.3 f-OFDM — Filtered OFDM
Filtered OFDM (f-OFDM) is a pragmatic variant that applies a single time-domain filter to each sub-band of contiguous subcarriers, rather than filtering at the individual-subcarrier level. The baseband model for sub-band \(b\) is:
where \(x_b(t)\) is the conventional CP-OFDM waveform for sub-band \(b\) and \(h_b(t)\) is a raised-cosine or windowed-sinc sub-band filter with a rolloff factor \(\beta\) and passband equal to the sub-band bandwidth.
9.3.1 Mixed Numerology in 5G NR
A key motivation for f-OFDM is the ability to multiplex different subcarrier spacings (\(\mu = 0, 1, 2, \ldots\) corresponding to SCS 15, 30, 60 kHz, …) in the same wideband carrier without mutual interference. Each numerology occupies its own sub-band and is filtered independently:
At the filter transition band (a few subcarriers wide), inter-sub-band interference (ISBI) exists. In 5G NR this is managed by the guard sub-band mechanism — typically 1–2 subcarriers are left unused at sub-band edges.
9.3.2 f-OFDM vs UFMC
| Property | f-OFDM | UFMC |
|---|---|---|
| Filtering granularity | Per sub-band (many RBs) | Per RB or small RB group |
| Filter length | Moderate (several symbols) | Short (\(L-1\) samples, \(L\ll N_{CP}\)) |
| CP retained? | Yes (within each sub-band) | No (filter provides delay tolerance) |
| Mixed numerology | Primary use case | Limited support |
| PAPR | Similar to OFDM | Similar to OFDM |
| Complexity | Low (B sub-band filters) | Higher (per-RB filters) |
9.4 UFMC — Universal Filtered Multi-Carrier
UFMC filters each resource block (RB) or small group of RBs individually using a Dolph-Chebyshev filter. The transmitted signal is the sum of per-RB filtered OFDM bursts:
where \(\mathbf{d}_i \in \mathbb{C}^{N_i}\) is the QAM symbol vector for RB \(i\), \(\mathbf{V}_i\) is the \(N \times N_i\) IDFT submatrix for the \(N_i\) active subcarriers of RB \(i\), and \(\mathbf{F}_i\) is the convolution (Toeplitz) matrix of the Dolph-Chebyshev filter of length \(L\). The total transmitted block length is \(N + L - 1\) samples — no CP is inserted.
9.4.1 Dolph-Chebyshev Filter Properties
The Dolph-Chebyshev filter minimizes the main lobe width for a given maximum sidelobe level (or vice versa). For a prescribed sidelobe attenuation \(A\) dB, the filter order \(L\) and window coefficients are determined analytically via:
where \(T_n(\cdot)\) is the Chebyshev polynomial of the first kind and \(x_0 = \cosh\!\bigl(\tfrac{1}{L-1}\cosh^{-1}(10^{A/20})\bigr)\). Typical UFMC parameters: \(L = 73\) (for a 5G NR 12-subcarrier RB at SCS 15 kHz), sidelobe attenuation 40 dB.
- Receive \(N + L - 1\) samples (no CP removal).
- Zero-pad to \(2N\) samples.
- Apply \(2N\)-point DFT.
- Take every other DFT output (decimation by 2): recovers the \(N\) subcarrier symbols on the desired RBs.
- Apply one-tap frequency-domain equalization per subcarrier (channel estimation needed; standard LS/MMSE pilots).
9.4.2 UFMC Target Scenarios
UFMC was proposed in 5GNOW (EU FP7 project) primarily for:
- NB-IoT and M-MTC: Low-cost devices with timing uncertainty — the Chebyshev filter tolerates asynchronous users within the same sub-band better than CP-OFDM.
- Short-burst communications: Removing the CP saves \(\sim 7\%\) overhead for short packets where control overhead already dominates.
- D2D and V2X: Asynchronous peer-to-peer links where strict uplink timing synchronization is impractical.
9.5 OTFS — Orthogonal Time-Frequency Space
OTFS is a modulation scheme proposed by Hadani et al. (2017) that places information symbols in the delay-Doppler (DD) domain rather than the time-frequency (TF) plane. The fundamental insight is that for typical wireless channels, the DD domain representation is sparse: each physical scatterer corresponds to a single (delay, Doppler) tap, making the channel extremely compact regardless of mobility.
9.5.1 Delay-Doppler Signal Model
Consider an \(M \times N\) OTFS frame. Symbols \(x[l,k]\) are placed on a DD grid with delay resolution \(\Delta\tau = 1/(M\Delta f)\) and Doppler resolution \(\Delta\nu = 1/(NT)\). The modulation chain is:
\(X[n,m]\) is the time-frequency domain representation, which is then converted to a time-domain waveform via the Heisenberg transform (a standard OFDM modulator):
where \(g_{tx}(t)\) is the transmit pulse (rectangular for OTFS-CP). At the receiver, the Wigner transform demodulates back to TF, and a 2D-SFFT transforms to DD:
This is a 2D circular convolution with the DD domain channel \(h_w[l',k']\) — a sparse matrix with at most \(P\) non-zero entries (one per scattering path). This sparsity underpins OTFS's efficiency advantage for channel estimation and equalization.
9.5.2 Full Diversity Property
Every OTFS symbol \(x[l,k]\) spreads its energy across the entire time-frequency plane via the 2D-ISFFT. As a result, each symbol experiences the full diversity of the channel — all \(P\) delay-Doppler paths contribute to each received DD-domain sample. The diversity order is:
In contrast, a single CP-OFDM subcarrier experiencing fading has diversity order 1 (per subcarrier) without coding or interleaving. This is why OTFS shows dramatically better BER at high mobility — it converts fading dips into a manageable average-SNR problem.
9.5.3 Relationship to CP-OFDM and PAPR
OTFS is mathematically equivalent to CP-OFDM preceded by a 2D-ISFFT precoding matrix. This means:
- PAPR: Identical to CP-OFDM (the precoding is unitary; it redistributes symbols across TF but does not reduce peak-to-average ratio statistically).
- Implementation: An OTFS modulator can be built on top of an existing OFDM stack with a software precoding layer.
- Guard intervals: OTFS-CP uses one CP per OTFS frame (not per symbol), recovering most of the CP overhead of symbol-by-symbol OFDM.
9.5.4 DD Channel Estimation
Since the channel is sparse in DD domain (\(P \ll MN\)), pilot-based estimation requires far fewer pilots than OFDM. A single impulse pilot at DD coordinate \((l_p, k_p)\) with guard region produces:
The guard region of size \((2l_{\max}+1)\times(2k_{\max}+1)\) around the pilot isolates the channel response from data interference. The pilot overhead scales with the channel spread, not the frame size, giving significant savings for sparse doubly-dispersive channels.
9.6 Waveform Comparison
| Waveform | CP overhead | OOB emissions | PAPR | High-mobility | Complexity | Standard |
|---|---|---|---|---|---|---|
| CP-OFDM | ~7% (\(\mu=0\)) | Poor (~−13 dB sidelobe) | High (~10–11 dB) | Poor (ICI at high \(\nu\)) | Low (DFT) | LTE, 5G NR, Wi-Fi |
| FBMC-OQAM | 0% (no CP) | Excellent (>40 dB) | High (same as OFDM) | Poor (long filter latency) | High (per-SC filter bank) | None (research/cognitive radio) |
| f-OFDM | ~7% (CP retained) | Good (>30 dB, sub-band) | High | Moderate | Low–Medium (B filters) | 5G NR research; not standardized |
| UFMC | 0% (no CP) | Good (>30–35 dB) | High | Moderate | Medium (per-RB filters) | 5GNOW proposal; NB-IoT studies |
| OTFS | ~1 CP/frame (<1%) | Same as CP-OFDM | High (same as OFDM) | Excellent (full diversity) | Medium (2D-ISFFT + OFDM) | 6G research; 3GPP NTN study |
9.7 OQAM Nyquist Condition for the Prototype Filter
The prototype filter \(p(t)\) must satisfy the real Nyquist condition to guarantee zero inter-symbol and inter-carrier interference in FBMC-OQAM. In the Zak (or Wigner-Ville) domain, this condition is:
Equivalently in the frequency domain (Balian-Low constraint):
This is the half-Nyquist (power complementary) condition. The PHYDYAS filter satisfies (9.19) up to a very good approximation for \(K \geq 4\). The ISOTROPE filter satisfies it exactly with a closed-form expression.
9.8 Visualizations
9.9 Study Questions
- FBMC vs CP-OFDM guard bands: A cognitive radio system must coexist with a licensed user 2 subcarriers away. CP-OFDM sidelobes at 2 subcarrier offset are approximately \(-20\log_{10}(2\pi \times 2) \approx -22\) dB below the main lobe. The PHYDYAS filter (\(K=4\)) achieves >40 dB suppression at the same offset. If the licensed user's interference tolerance is −30 dBc, explain why CP-OFDM requires a guard band of at least 4–5 subcarriers while FBMC-OQAM can operate with zero guard subcarriers. What constraint does this place on the FBMC frame latency if SCS = 15 kHz and \(K=4\)?
- OTFS diversity order: A channel has \(P=3\) scattering paths. An OTFS frame uses \(M=16\) delay bins and \(N=8\) Doppler bins. Using equation (9.16), what is the maximum diversity order available to each symbol? Contrast this with coded CP-OFDM (coding gain but not modulation diversity) and explain why OTFS BER curve slope is steeper. Under what condition does OTFS diversity collapse to match OFDM (hint: consider the rectangular pulse guard region in the DD domain)?
- UFMC vs FBMC latency trade-off: Both UFMC and FBMC remove the CP and reduce OOB emissions. UFMC uses a Dolph-Chebyshev filter of length \(L=73\) samples at SCS 15 kHz (FFT size \(N=2048\)). FBMC-OQAM with \(K=4\) has a prototype filter of length \(L_p = K \times N = 8192\) samples. Calculate the additional filter-induced latency in microseconds for each scheme. For a URLLC application requiring end-to-end latency < 1 ms, which waveform is disqualified and why? What is the minimum practical overlapping factor \(K_{\min}\) for FBMC to remain within this latency budget at SCS = 15 kHz?
OFDM Pros & Cons — Comprehensive Analysis
This section provides a rigorous, balanced assessment of CP-OFDM — the waveform that underpins 4G LTE, 5G NR, Wi-Fi 4–7, and most modern broadband standards. Every advantage carries a quantified engineering justification; every disadvantage is paired with its real-world impact and the mitigations that have been standardised or researched. The section closes with a deployment decision matrix and two interactive charts.
1 Advantages of CP-OFDM
1a Multipath Resilience via Single-Tap Equalization
The cyclic prefix converts the linear channel convolution into a circular one. In the frequency domain this means that the received subcarrier k sees a multiplicative scalar channel coefficient rather than inter-symbol interference:
Equalising each subcarrier therefore costs exactly one complex multiplication per subcarrier per OFDM symbol, regardless of the channel delay spread. Compare this with the brute-force single-carrier (SC) alternative:
where \(M\) is the constellation order and \(J\) is the number of channel taps. For a 100-tap LTE Urban-Macro channel with 64-QAM (\(M=64\)):
- OFDM: 100 complex multiplications (one per tap-equivalent subcarrier)
- SC + Viterbi: \(64^{100} \approx 10^{180}\) states — computationally impossible.
1b Efficient FFT-Based Implementation
The modulator/demodulator is a single IFFT/FFT pair. The Cooley–Tukey radix-2 FFT reduces the multiply count from \(N^2\) (direct DFT) to \(\frac{N}{2}\log_2 N\) complex multiplications:
For N = 2048 (LTE 20 MHz / 5G NR μ=0 at 30.72 Msps):
| Method | Operations | Ratio |
|---|---|---|
| Direct DFT (N²) | 4,194,304 | 1× |
| Radix-2 FFT | 22,528 | 186× fewer |
For N = 4096 (400 MHz NR, μ=3) the savings are even larger: 8,192 vs 16,777,216 — a 2048× reduction. This makes real-time 100+ MHz OFDM feasible on a single embedded DSP or FPGA slice.
1c Flexible Frequency-Domain Resource Allocation (OFDMA)
Because each subcarrier is modulated independently, the scheduler can allocate different subsets of subcarriers to different users, power levels, and MCS in each OFDM symbol. This enables three powerful capabilities:
- Water-filling capacity: allocate power inversely proportional to noise level per subcarrier. Theoretical capacity approaches the Shannon limit: \[C = \sum_{k} B_\text{sc} \log_2\!\left(1 + \frac{P_k |H[k]|^2}{\sigma^2}\right)\]
- Frequency-selective multi-user scheduling: assign subcarrier group \(k\) to the user whose channel \(|H_u[k]|\) is largest — multiuser diversity gain scales as \(\log\log K\) for \(K\) users (Knopp–Humblet theorem).
- Non-contiguous spectrum access: subcarriers in a fragmented licensed band or cognitive-radio secondary band can be individually toggled, something impossible with a single-carrier system without a full redesign.
1d Easy MIMO Implementation
At each subcarrier the channel is flat-fading: the MIMO channel matrix \(\mathbf{H}[k] \in \mathbb{C}^{N_r \times N_t}\) is constant across that subcarrier's bandwidth. Spatial multiplexing therefore reduces to a simple matrix equation per subcarrier:
For massive MIMO with \(N_t = 64\) transmit antennas and a single-stream UE (\(N_r = 1\)), the per-subcarrier equalizer is 64 scalar multiplications — one weight per antenna. Contrast this with a wideband single-carrier massive MIMO equalizer which requires a full \(64 \times J\)-tap matrix filter per receive antenna.
1e Scalable Bandwidth and Numerology
The OFDM framework parameterises cleanly: subcarrier spacing \(\Delta f = 1/T_u\), FFT size \(N\), CP length \(N_\text{CP}\), and sampling rate \(f_s = N \cdot \Delta f\). 5G NR defines five numerologies sharing the same baseband architecture:
| μ | Δf (kHz) | Nominal FFT (FR1) | Max BW (MHz) | Use case |
|---|---|---|---|---|
| 0 | 15 | 2048 | 50 | LTE-compat, eMBB sub-6 |
| 1 | 30 | 4096 | 100 | eMBB sub-6 primary |
| 2 | 60 | 4096 | 200 | FR1 high-capacity / mmWave FR2 |
| 3 | 120 | 4096 | 400 | FR2 eMBB / V2X low-latency |
| 4 | 240 | 4096 | 400 | FR2 reference signal only |
Only the FFT size, CP tap count, and sampling clock change between numerologies. The IFFT/FFT hardware IP block is identical — a major silicon cost saving.
1f Spectral Efficiency
Orthogonality allows subcarriers to overlap in frequency while remaining inter-symbol interference-free — eliminating the guard bands required between single-carrier channels:
With adaptive modulation and coding (AMC) per subcarrier, OFDM approaches the Shannon bound. Practical LTE/NR peak spectral efficiencies (downlink):
- LTE Cat-4 SISO: ≈5.1 bit/s/Hz at 64-QAM R=4/5
- LTE Advanced 4×4 MIMO: ≈16 bit/s/Hz (4-layer 256-QAM)
- 5G NR 8×8 DL MIMO: up to ≈30 bit/s/Hz (8-layer 256-QAM 5/6)
2 Disadvantages of CP-OFDM
2a High Peak-to-Average Power Ratio (PAPR)
An OFDM signal is the sum of \(N\) independently modulated sinusoids. By the central limit theorem the real and imaginary parts approach Gaussian distributions as \(N \to \infty\), giving a CCDF tail that decays slowly. The theoretical maximum PAPR is:
For N = 2048 this is 33 dB, but the practical 10-4 CCDF value is 10–12 dB. The consequences for the power amplifier (PA) are severe:
| PA back-off | PA efficiency (typical class-AB) | Comment |
|---|---|---|
| 0 dB (at 1-dB comp.) | ~50% | Heavy clipping / distortion |
| 3 dB | ~25% | Marginal linearity |
| 10 dB (required for OFDM) | ~5% | Standard LTE downlink PA |
A 10 dB back-off reduces PA drain efficiency from ~50% to ~5% — a 10× increase in power consumption for the same useful RF output. This is why 3GPP chose DFT-s-OFDM (SC-FDMA) for the LTE/NR uplink: the single-carrier nature reduces PAPR by 4–6 dB, cutting UE battery drain significantly.
- DFT-s-OFDM: spreads data across all subcarriers via DFT pre-coding → 4–6 dB PAPR reduction; used in NR UL.
- Clipping & filtering: clip at 8–9 dB PAPR threshold → ≈2–3 dB gain at cost of EVM floor ≈−30 dB.
- SLM / PTS: Selected Mapping / Partial Transmit Sequences → 3–4 dB gain; requires side information overhead (<2 bits/symbol).
- Tone reservation: reserve ≈2% subcarriers for PAPR cancellation tones (IEEE 802.11af approach).
2b Cyclic Prefix Overhead
The CP carries no new data — it is a copy of the tail of the OFDM symbol prepended to guard against ISI. Every CP sample is "wasted" bandwidth:
Concrete values:
| Standard | N | NCP (normal) | Overhead per symbol | Effective average overhead |
|---|---|---|---|---|
| LTE normal CP | 2048 | 144 (6 of 7 symbols) / 160 (1st symbol) | 144/2192 = 6.57% | ≈8.3% (slot average) |
| LTE extended CP | 2048 | 512 | 512/2560 = 20% | 20% (all symbols) |
| 5G NR μ=0 | 2048 | 144/160 | same as LTE | ≈7.1% |
| 5G NR μ=3 | 512 | 36/40 | 36/548 = 6.6% | ≈7.0% |
The overhead cannot be reduced below the channel's maximum excess delay divided by the sampling period without incurring ISI. For LTE, the normal CP covers \(N_\text{CP} \times T_s = 144 / 30.72\text{ MHz} = 4.69\;\mu\text{s}\) — sufficient for macro-cell delay spreads up to ≈1.4 km path-length difference.
2c Sensitivity to CFO and Phase Noise
A carrier frequency offset (CFO) \(\varepsilon\) normalised to subcarrier spacing (\(\varepsilon = \delta f / \Delta f\)) breaks orthogonality, causing inter-carrier interference (ICI). The SIR from ICI is approximately:
Example: a 750 Hz CFO on a 15 kHz subcarrier grid (\(\varepsilon = 0.05\)):
This limits the useful SNR to 20.8 dB — insufficient for 256-QAM (requires >30 dB SIR). Real oscillator specs must achieve sub-10 Hz residual CFO after AFC for 256-QAM links.
Phase noise at mmWave: LO phase noise power spectral density scales as \(f_c^2\) (for a free-running VCO). At 60 GHz vs 30 GHz the phase noise is 4× worse in power (6 dB) — and the wider 120 kHz / 240 kHz subcarrier spacing of 5G NR FR2 only partially compensates. 3GPP introduced Phase Tracking Reference Signals (PTRS) in NR FR2 specifically to track and cancel low-frequency phase noise.
- AFC loops: track residual CFO to <1% of Δf.
- PTRS (5G NR TS 38.211 §7.4.1.2): dense phase-error pilots in time; spans 1 in every 4 OFDM symbols at high MCS.
- Wider SCS: larger Δf → larger tolerable absolute CFO at same normalised ϵ.
- Phase-locked oscillators: tighter PLL bandwidth reduces integrated phase noise.
2d Out-of-Band Emissions (OOB)
The OFDM time-domain symbol has a rectangular (abrupt) window. Its frequency spectrum is a sum of sinc functions, with the first sidelobe at only −13 dB relative to the main lobe:
To protect adjacent channels, guard bands must be inserted. LTE allocates approximately 10% of the channel bandwidth as guard subcarriers (e.g., 72 unused subcarriers at each edge of the 2048-point FFT in 20 MHz mode). These are in addition to any regulatory spectral mask requirements.
- Windowing (raised-cosine / Hann): roll-off at symbol boundary → sidelobe ≈ −40 dB at cost of ≈0.5% CP length for ramp.
- f-OFDM (filtered OFDM): per-subband bandpass filter; used in some 5G NR mixed-numerology proposals.
- FBMC (Filter Bank Multicarrier): per-subcarrier Nyquist filter; achieves −80 dB OOB but requires twice the prototype filter length (see §9).
- DFT-spread OFDM + spectrum shaping: the single-carrier envelope reduces OOB by ≈6–8 dB vs CP-OFDM for equivalent output power.
2e Poor Performance in Doubly Dispersive Channels
CP-OFDM handles delay spread (frequency selectivity) via the CP, but it handles Doppler spread (time selectivity) only if the channel is quasi-static within one OFDM symbol duration \(T_u\). The ICI caused by Doppler shift \(f_D\) is approximately:
For a 5G NR μ=0 symbol at 250 km/h and 3.5 GHz: \(f_D = 250 \times 3.5 \times 10^9 / (3 \times 10^8 \times 3.6) \approx 810\;\text{Hz}\), and \(T_u = 66.7\;\mu\text{s}\), giving \(f_D T_u \approx 0.054\) and SIR ≈ 20.4 dB — borderline for 256-QAM.
When both delay spread and Doppler are significant (doubly dispersive), the CP may be insufficient (ISI) and the symbol duration may be too long (ICI) simultaneously. Increasing \(\Delta f\) (shorter symbol) reduces ICI but also reduces the maximum tolerable delay spread.
- Higher SCS: μ=3 (120 kHz) at mmWave → shorter symbol → more Doppler robust.
- OTFS (Orthogonal Time Frequency Space): modulates in the delay-Doppler domain → full diversity for doubly dispersive channels (see §9).
- Robust pilot patterns: denser pilot grids for high-mobility channels (e.g., NR PDSCH DMRS type-B with additional CDM groups).
- V2X slot design: 3GPP Rel-17 NR-V2X uses the 60 kHz SCS specifically to handle high Doppler in vehicular scenarios.
2f High ADC/DAC Complexity for Wide Bandwidth
OFDM exploits bandwidth by increasing the number of subcarriers, which requires proportionally higher sampling rates. The Nyquist sampling rate for a 400 MHz NR carrier (μ=3, FFT=4096) with standard oversampling is:
After 2× oversampling for interpolation filtering: 983.04 Msps. ADC power consumption scales roughly as \(f_s \times 2^\text{ENOB}\), so a 12-bit ADC at 1 Gsps consumes ≈500 mW — substantial in a battery-powered UE. Contrast with NB-IoT (180 kHz, μ=0): ADC at 240 ksps draws <1 mW.
3 Net Verdict: Why OFDM Won — and Where It Falls Short
CP-OFDM won the 4G/5G standardisation process for a combination of reasons that no single-carrier alternative could simultaneously match in the mid-2000s engineering environment:
- Equalizer simplicity: one multiplication per subcarrier vs exponential SC complexity enabled real-time 20 MHz 2×2 MIMO on the <65 nm silicon of 2008.
- OFDMA multi-user flexibility: sub-carrier granularity enabled the LTE scheduler architecture that drives the 3–5× throughput gains of LTE over HSPA.
- MIMO synergy: per-subcarrier flat-fading made 4×4 MIMO SIC/SU-MIMO tractable without a joint space-time processor.
- Ecosystem momentum: ADSL, 802.11a/g, DVB-T all used OFDM; VLSI toolchains, test equipment, and channel models were already OFDM-native.
Where it falls short: OFDM is sub-optimal for (a) high-mobility vehicular/aerial channels (>500 km/h), (b) UE uplink power efficiency, (c) uncoordinated adjacent-channel coexistence due to OOB, and (d) future THz channels with extreme phase noise. The industry has addressed these with DFT-s-OFDM (UL), PTRS (mmWave), windowed/filtered variants (adjacent-channel), and is actively researching OTFS and AFDM for 6G high-mobility scenarios.
4 Deployment Decision Matrix
| Scenario | Recommended Waveform | Key Reason | Standard |
|---|---|---|---|
| High-mobility V2X (>250 km/h) | OTFS or NR μ=3 | Doppler > 1 kHz → ICI floor in CP-OFDM; OTFS spreads energy over full delay-Doppler grid | 3GPP Rel-17 NR-V2X; OTFS research (IEEE 2017+) |
| UE uplink (battery-constrained) | DFT-s-OFDM | 4–6 dB PAPR reduction → PA back-off ≈6 dB → 2× better PA efficiency | LTE SC-FDMA; 5G NR PUSCH DFT-s (TS 38.211) |
| mmWave gNB downlink | CP-OFDM + PTRS | Massive MIMO flat-fading per SC; PTRS corrects LO phase noise at 28/39 GHz | 5G NR FR2 (TS 38.211 §7.4.1.2) |
| Cognitive / secondary radio | FBMC-OQAM | OOB −80 dB via prototype filter; no CP needed for intra-band coexistence | IEEE 802.22; 5GNOW project |
| 6G THz / ISAC | OTFS + ISAC (TBD) | Extreme Doppler; radar ambiguity function requirements; delay-Doppler native sensing | ITU-R IMT-2030; ongoing 3GPP study item |
| IoT narrowband (NB-IoT) | Narrow-CP-OFDM | Simplicity; reuses LTE numerology (μ=0, 15 kHz); single-subcarrier option for lowest power | 3GPP TS 36.211 (NB-IoT), Rel-13+ |
| Wi-Fi 7 (IEEE 802.11be) | CP-OFDM + 16384-FFT | 4096 subcarriers, 320 MHz BW; OFDMA multi-link operation; indoor → low mobility OK | IEEE 802.11be (2024) |
- PAPR and PA efficiency: A 5G NR gNB uses a class-AB PA with peak efficiency 48% at saturation. The OFDM signal requires 8 dB back-off. Using the quadratic back-off model \(\eta(p) = \eta_\text{sat}\,(p/p_\text{sat})\), calculate the operating PA efficiency and the wasted power fraction. How much does DFT-s-OFDM (4 dB PAPR reduction) improve this?
- CFO budget allocation: A 5G NR NR-DC link operates at 3.5 GHz with 30 kHz SCS. The system requires SIR ≥ 30 dB on 256-QAM. Using the ICI SIR formula \(\text{SIR} \approx 3/(\pi^2\varepsilon^2)\), derive the maximum tolerable normalised CFO \(\varepsilon\). Convert this to an absolute frequency error in Hz and state whether a typical TCXO (±0.5 ppm) satisfies this budget.
- Waveform design trade-off: You are designing the PHY for a 6G high-altitude platform station (HAPS) link at 26 GHz with UE speed up to 1000 km/h and a delay spread of 2 μs. (a) Compute the coherence bandwidth and minimum subcarrier count for 100 MHz system bandwidth. (b) Compute the maximum Doppler frequency. (c) Evaluate whether CP-OFDM with μ=3 can handle this channel, or whether OTFS is required. Justify using both the ICI SIR formula and the Zadoff criterion \(f_D T_u \ll 1\).
6G Outlook — THz, AI-Native Waveforms, ISAC, and Beyond OFDM
The fifth generation of cellular communications is not yet fully deployed, yet the research community and standards bodies are already defining its successor. The ITU-R IMT-2030 framework sets requirements so demanding that they challenge the fundamental assumptions behind CP-OFDM. This section surveys the key technology pillars of 6G: the physics of THz propagation, the delay-Doppler domain waveform OTFS designed for extreme mobility, AI/ML-native waveform design, the integrated sensing-and-communications paradigm, Reconfigurable Intelligent Surfaces, and the green-communications imperative — and asks where, precisely, OFDM and its successors stand in each.
11.1 6G Vision and IMT-2030 Targets (ITU-R WP5D)
ITU-R Working Party 5D published the IMT-2030 Framework Recommendation (M.2160, 2023) establishing the key performance indicators (KPIs) that candidate technologies must meet by 2030. The jump from 5G to 6G is more radical than the 4G-to-5G transition: several KPIs improve by two orders of magnitude, and entirely new capability classes — native sensing and native AI/ML — are added to the requirement set.
| KPI | 5G NR (IMT-2020) | 6G (IMT-2030 Target) | Improvement |
|---|---|---|---|
| Peak data rate | 20 Gbps (DL) | 1 Tbps (DL) | 50× |
| User experienced rate | 100 Mbps | 1 Gbps | 10× |
| U-plane latency | 1 ms (URLLC) | 0.1 ms | 10× |
| Reliability (BLER) | 10−5 | 10−7 | 100× |
| Connection density | 106 devices/km² | 107 devices/km² | 10× |
| Area traffic capacity | 10 Mbps/m² | 1 Gbps/m² | 100× |
| Sensing range resolution | N/A | Sub-centimetre | New capability |
| Sensing velocity resolution | N/A | Sub-mm/s | New capability |
| Energy efficiency | Baseline | 100× bits/Joule | 100× |
| AI/ML integration | Study item (Rel-18) | Native, normative | New paradigm |
The candidate frequency bands reflect the dual imperative of coverage and capacity:
- FR3 / Upper mid-band (7–24 GHz): Best balance of coverage and capacity for macro deployments; likely the primary 6G deployment band. The 7–15 GHz range in particular offers wide contiguous blocks unavailable in sub-6 GHz.
- Sub-THz (100–300 GHz): Ultra-wideband operation enabling Tbps peak rates in dense indoor or backhaul scenarios. Feasibility confirmed in lab; outdoor deployment remains a research challenge.
- THz (>300 GHz): Long-term research horizon. Sub-cm sensing resolution. Fundamental challenges in device technology, phase noise, and molecular absorption.
11.2 THz Band Challenges for OFDM
The THz band offers extreme bandwidth, but its propagation physics stress every assumption that makes CP-OFDM efficient. Understanding these constraints quantitatively is essential for any system designer working on 6G air-interface selection.
Molecular Absorption
Unlike lower-frequency bands where path loss follows a smooth power law, the THz band has sharply frequency-selective molecular absorption windows. Atmospheric O2 produces a strong absorption peak near 60 GHz (used deliberately for backhaul isolation in WiGig), while H2O vapour dominates at 183 GHz and 325 GHz. The implication for OFDM is a non-flat channel at the system level — even before multipath — requiring either careful subband selection or absorption-aware water-filling power allocation.
Phase Noise Scaling
Oscillator phase noise power spectral density grows as the square of the carrier frequency. A 300 GHz oscillator with state-of-the-art design exhibits approximately 100× worse phase noise than a comparable 30 GHz oscillator. In OFDM, phase noise causes two degradation modes:
- Common Phase Error (CPE): A time-domain phase rotation that applies identically to all subcarriers in one OFDM symbol — correctable with pilot-based phase tracking.
- Inter-Carrier Interference (ICI): Energy from one subcarrier leaking into adjacent subcarriers — fundamentally degrades SNR and is not correctable by simple pilots. ICI power scales roughly as the ratio of phase noise bandwidth to subcarrier spacing.
Ultra-Wideband Delay Spread
At 100 GHz of instantaneous bandwidth, a modest channel delay spread of \(\tau_{\max} = 50\,\text{ns}\) translates to 5,000 channel taps at the Nyquist rate. The CP duration must cover this delay spread, incurring significant overhead, and the equaliser must handle a very long channel — erasing the simplicity of one-tap per-subcarrier equalisation unless a grouped-subcarrier or OFDM-with-DFT-spreading approach is used.
Extreme Doppler at THz
At a carrier of \(f_c = 300\,\text{GHz}\) and pedestrian speed \(v = 100\,\text{km/h}\), the Doppler shift is:
Path Loss and Pencil Beamforming
Free-space path loss at 300 GHz over 10 m is approximately 102 dB — roughly 20 dB worse than at 30 GHz over the same distance. Atmospheric absorption adds further loss. Compensation requires extremely narrow beams: with \(N_{\text{ant}} = 1024\) elements, the array gain is \(10\log_{10}(1024) \approx 30\,\text{dB}\) — barely sufficient to close the link at short range. Beam management at THz must operate at microsecond timescales, far faster than 5G mmWave beam management.
11.3 OTFS for 6G High-Mobility Scenarios
Orthogonal Time Frequency Space (OTFS), proposed by Hadani et al. (2017), modulates information in the delay-Doppler (DD) domain rather than the time-frequency domain used by OFDM. In channels with extreme Doppler — LEO satellite communications being the canonical example — OTFS achieves near-ideal performance while OFDM's subcarrier orthogonality is destroyed.
LEO Satellite Channel: Motivating Numbers
OTFS Signal Model
OTFS uses a 2D transform — the Symplectic Finite Fourier Transform (SFFT) — to convert the time-frequency plane into the delay-Doppler plane, where the mobile channel appears as a sparse, separable (delay, Doppler) impulse response:
OTFS vs OFDM: Capacity and Diversity
- Ergodic capacity: OTFS and OFDM are theoretically equivalent in ergodic capacity — both achieve the Shannon limit given optimal coding and water-filling power allocation. The key difference is diversity.
- Diversity order: In a channel with \(P\) paths, OTFS achieves full delay-Doppler diversity order \(P\); CP-OFDM with a single-tap equaliser achieves diversity 1 per subcarrier (without coding). Uncoded BER curves differ dramatically.
- Channel estimation overhead: OTFS requires only a single pilot in the DD plane to estimate all \(P\) paths simultaneously (impulse pilot method). CP-OFDM requires pilot tones every \(\sim 14\) symbols in time and every few subcarriers in frequency, scaling with the channel coherence time and bandwidth.
- 3GPP NTN (Non-Terrestrial Networks): Rel-17 and Rel-18 introduced NTN support in NR, including Doppler pre-compensation at the UE. OTFS is actively studied as a candidate for 5G Advanced / 6G NTN air interface (SI: RP-230591).
11.4 AI/ML-Native Waveform Design
The conventional communications system design paradigm — derive an analytical channel model, engineer each block (modulation, channel coding, equalization, detection) separately to match the model — faces fundamental limitations in the complex, non-stationary channels of THz and high-mobility 6G. The emerging alternative is end-to-end learning: treat the transmitter and receiver jointly as an autoencoder and optimise them together via stochastic gradient descent.
Autoencoder Approach (Simonyan/Gruber Framework)
Key empirical findings from DeepSIG, OpenAirInterface (OAI), and the academic literature:
- AWGN channel: The autoencoder converges to constellations that closely resemble standard QAM/PSK — confirming that classical modulation is near-optimal in AWGN, and that the neural network rediscovers this independently.
- Fading channel (known to RX): The autoencoder adapts the constellation geometry and pulse shape to maximise diversity. The learned constellations deviate significantly from rectangular QAM at low SNR.
- Unknown or mismatched channel: End-to-end learning outperforms OFDM with model-based equalization when channel statistics deviate from the assumed model — a crucial advantage in THz channels with rapidly varying absorption and scattering.
- Generalization: A single autoencoder trained on a distribution of channels (e.g., stochastic CDL-A to CDL-E) generalises better than a fixed modulation scheme matched to one channel type.
3GPP AI/ML Channel Estimation — Rel-18 Study Item
3GPP Release 18 includes a study item (RP-213599) on AI/ML for NR physical layer enhancements, covering three use cases: (1) CSI feedback compression, (2) beam management, (3) channel estimation and equalisation. The Rel-18 study is not normative — it defines the evaluation framework and assesses feasibility. Normative AI/ML PHY enhancements are targeted for Rel-19/20.
Semantic Communication
Beyond bit-level optimisation, semantic communication transmits meaning rather than bits. For structured data (images, speech, sensor readings), joint source-channel coding at the semantic level achieves 10–100× compression over separate source coding + channel coding pipelines at the same reconstruction quality. This requires a shared semantic model between transmitter and receiver — a major departure from the bit-pipe model underlying all existing standards.
11.5 ISAC — Integrated Sensing and Communications
ISAC (also called Dual-Function Radar-Communication, DFRC) uses a single waveform, transceiver hardware, and spectrum for both wireless communications and radar-like environmental sensing. OFDM's structure — known pilots on a regular time-frequency grid — makes it a natural dual-purpose waveform.
OFDM Radar Processing
Unified ISAC Signal Model
ISAC Pareto Frontier and Trade-off
The achievable region for simultaneous communications (bit rate \(R\)) and sensing (estimation SINR \(\Gamma_s\)) forms a Pareto frontier parameterised by the power split between data and reference (pilot/sensing) symbols. Key results:
- Dedicating all power to data (standard OFDM) maximises \(R\) but reduces sensing SINR — pilots are sparse and sensing relies on random data correlations.
- Dedicating all power to sensing reference signals maximises \(\Gamma_s\) but \(R = 0\).
- The Pareto-optimal operating point depends on the specific use case. For automotive sensing at 79 GHz, a 10–20% pilot density sacrifice gives >30 dB sensing SINR with <5% capacity loss.
3GPP ISAC in 5G NR and 6G
- SRS-based monostatic sensing (Rel-19): The gNB transmits uplink SRS resources for the UE; in monostatic mode the gNB's own SRS transmission reflects off targets back to the same gNB. WI: NR_ISAC_Ph1 (RP-251881).
- PRS-based bistatic sensing: A reference gNB transmits PRS; a sensing gNB receives the reflected/scattered signal from targets. Uses the existing NR positioning reference signal infrastructure.
- 6G native ISAC: Rather than retrofitting sensing onto a communications waveform, 6G targets dual-function waveform design from the ground up: waveform basis, numerology, pilot pattern, and beamforming jointly optimised for both link performance and sensing accuracy KPIs.
11.6 RIS — Reconfigurable Intelligent Surfaces
A Reconfigurable Intelligent Surface (RIS) is a planar array of passive electromagnetic elements, each with a controllable reflection phase shift. Unlike an active relay, a RIS does not amplify or decode the signal — it passively steers reflected wavefronts. This enables coverage extension without additional RF chains, power amplifiers, or noise amplification.
OFDM + RIS Channel Model
The passive beamforming gain of an \(L\)-element RIS scales as \(L^2\) (coherent combination), compared to \(L\) for active arrays. This quadratic scaling is the primary motivation: a 256-element RIS provides 48 dB of passive beamforming gain, compensating for the 20 dB additional path loss of mmWave / sub-THz bands over microwave. Practical constraints include:
- Channel state information acquisition — the RIS is passive, so the controller must obtain CSI via active sounding through the RIS.
- Phase resolution — 1-bit quantised phase control (0° or 180°) gives approximately 3 dB loss vs continuous phase; 2-bit (4 phases) recovers most of the theoretical gain.
- Coupling and mutual impedance between adjacent elements at sub-wavelength spacing.
11.7 Energy Efficiency in 6G
The 100× bits-per-Joule target relative to 5G is the most challenging KPI in the IMT-2030 framework. The global ICT sector currently consumes approximately 2–3% of world electricity; with the projected 100× traffic growth from 5G to 6G, aggressive energy efficiency improvement is essential for sustainability.
OFDM Energy Overhead
CP-OFDM has inherent energy inefficiencies compared to narrowband alternatives:
- High PAPR (Peak-to-Average Power Ratio): OFDM signals with \(N\) subcarriers have PAPR growing as \(\log N\). Power amplifiers must be backed off from peak power, dramatically reducing their power-added efficiency (PAE). A typical 5G massive MIMO base station PA operates at 5–10% PAE due to OFDM PAPR; at 100% efficiency it would use 10–20× less energy for the same transmitted power.
- CP overhead: \(\mu = N_{\text{CP}} / (N + N_{\text{CP}})\) of all transmitted energy is discarded at the receiver.
- FFT processing energy: At 100 GHz bandwidth, the ADC, FFT, and baseband processing power is dominated by the FFT at \(O(N\log N)\) operations per OFDM symbol, at clock speeds approaching \(10^{11}\) operations/second.
Green 6G Alternatives
- Wake-up receivers: Secondary ultra-low-power receiver (1–10 μW) keeps the primary radio off until a wake-up signal is received, enabling near-zero idle power IoT devices.
- Ambient backscatter communication: Sensors modulate data onto reflected ambient RF energy (from existing 5G/WiFi signals) without any local oscillator or power amplifier. Achieves μW-level operation — incompatible with OFDM but viable for simple IoT sensor reporting at very low data rates.
- Dynamic spectrum access: Cognitive-radio-style spectrum sharing reduces the power consumed in idle monitoring of spectrum.
- Near-zero-energy IoT: Energy harvesting from RF, light, or mechanical vibration, combined with ultra-narrowband (UNB) modulation — the opposite end of the design space from wideband OFDM.
11.8 Summary: OFDM Evolution Across Generations
| Generation | Primary Waveform(s) | Key OFDM Innovation | Peak Rate | Bands | New Capability |
|---|---|---|---|---|---|
| 4G LTE | CP-OFDM (DL), DFT-s-OFDM (UL) | OFDMA multiple access; SC-FDMA for UL PAPR reduction | 1 Gbps | Sub-3 GHz | Mobile broadband at scale |
| 5G NR | CP-OFDM + DFT-s-OFDM | Flexible numerology (SCS 15–240 kHz); mmWave OFDM; massive MIMO | 20 Gbps | Sub-6 GHz, FR2 (24–52 GHz) | URLLC, eMBB, mMTC |
| 5G Advanced | CP-OFDM + ISAC extensions | Sensing RS (SRS/PRS-based); dual-function pilot patterns; AI/ML PHY (study) | 20+ Gbps | Sub-6 GHz, FR2, FR3 (7–24 GHz, study) | Native positioning; sensing feasibility; AI/ML study |
| 6G (IMT-2030) | OTFS / CP-OFDM / AI-native / UNB-IoT | Native ISAC waveform; DD-domain modulation; end-to-end learned waveforms; RIS integration | 1 Tbps | FR3, sub-THz (100–300 GHz), THz (>300 GHz) | Native sensing; native AI/ML; native energy efficiency; NTN integration |
Study Questions
- THz OFDM design trade-off: At a carrier frequency of 300 GHz with 100 GHz bandwidth, derive the required CP length (in samples) if \(\tau_{\max} = 30\,\text{ns}\), and compute the resulting CP overhead fraction for subcarrier spacings of 480 kHz and 960 kHz (5G NR numerologies 5 and 6). Given that phase noise ICI power scales as \(f_c^2 \cdot T_s^2\), how does doubling the SCS affect the ICI-vs-CP-overhead trade-off, and at what SCS does ICI become the dominant impairment rather than CP overhead?
- OTFS diversity advantage: A LEO satellite channel at 30 GHz has 4 paths with delays \(\ell \in \{0, 1, 3, 7\}\) delay bins and Doppler shifts \(k \in \{0, 5, -3, 12\}\) Doppler bins on a \(32 \times 32\) DD grid. Sketch the 2D channel response in the delay-Doppler plane. What is the full diversity order achievable by OTFS (assuming ideal pulse shaping and optimal coding)? Why does CP-OFDM with a one-tap equaliser achieve diversity order 1 in this channel, and what coding/equalisation approach would be needed to recover full diversity with OFDM?
- ISAC Pareto frontier: Consider an OFDM ISAC system with \(N = 1024\) subcarriers, \(M = 14\) symbols per slot, and total power \(P_T\). Let a fraction \(\alpha \in [0,1]\) of resource elements carry data and \((1-\alpha)\) carry sensing reference signals (known waveform). Express the communications spectral efficiency \(R(\alpha)\) and the sensing Fisher information \(\mathcal{I}(\alpha)\) as functions of \(\alpha\). Show that \(R(\alpha)\) is concave and \(\mathcal{I}(\alpha)\) is monotonically decreasing in \(\alpha\), and determine the \(\alpha^*\) that maximises \(R + \lambda \mathcal{I}\) for a given Lagrange multiplier \(\lambda\) (sensing priority weight). What does the optimal \(\alpha^*\) imply for the design of 6G dual-function pilot patterns?
3GPP OFDM Signal Generation — 4G LTE & 5G NR
This section walks through the normative OFDM baseband signal models defined in 3GPP TS 36.211 (LTE) and TS 38.211 (NR), covering downlink CP-OFDM, uplink DFT-s-OFDM (SC-FDMA in LTE), and the Sounding Reference Signal (SRS) for both generations. Each model is accompanied by an interactive chart.
12.1 LTE Physical Layer — Parametric Overview TS 36.211 §5–6
LTE uses CP-OFDM on the downlink (PDSCH, PDCCH, etc.) and DFT-s-OFDM (SC-FDMA) on the uplink (PUSCH) to keep the UE power-amplifier back-off low. Key parameters are fixed across all bandwidths:
| BW (MHz) | NFFT | Nused | NRB | CPnormal (μs) | CPext (μs) | Tsymbol (μs) |
|---|---|---|---|---|---|---|
| 1.4 | 128 | 76 | 6 | 4.69 | 16.67 | 71.35 |
| 3 | 256 | 152 | 12 | 4.69 | 16.67 | 71.35 |
| 5 | 512 | 300 | 25 | 4.69 | 16.67 | 71.35 |
| 10 | 1024 | 600 | 50 | 4.69 | 16.67 | 71.35 |
| 15 | 1536 | 900 | 75 | 4.69 | 16.67 | 71.35 |
| 20 | 2048 | 1200 | 100 | 4.69 | 16.67 | 71.35 |
Subcarrier spacing is fixed at \(\Delta f = 15\,\text{kHz}\), yielding a useful symbol duration \(T_u = 1/\Delta f = 66.67\,\mu\text{s}\). The first CP of each 0.5 ms slot is slightly longer (5.21 μs) to maintain the 0.5 ms frame alignment.
12.2 LTE Downlink — CP-OFDM Signal Model TS 36.211 §6.12
The complex baseband OFDM time-domain signal for OFDM symbol \(\ell\) in subframe \(s\) is (TS 36.211 Eq. 6.12-1):
\[ s_{\ell}(t) = \sum_{k=-N_{\rm FFT}/2}^{N_{\rm FFT}/2 - 1} a_{k,\ell}\,e^{j2\pi k\Delta f\,(t - N_{\rm CP}T_s)}, \quad 0 \le t < (N_{\rm CP}+N_{\rm FFT})\,T_s \]where \(a_{k,\ell}\) is the complex modulation symbol on subcarrier \(k\), symbol \(\ell\); \(T_s = 1/(15000 \times 2048) \approx 32.55\,\text{ns}\) is the basic LTE time unit; \(N_{\rm CP}\) is the CP length in samples (160 for the first symbol, 144 for symbols 1–6 in a 0.5 ms slot). Implemented as IFFT + CP prepend:
\[ \mathbf{s}_\ell = \underbrace{\text{CP}}_{\text{last }N_{\rm CP}\text{ samples}} \| \; \text{IFFT}_{N_{\rm FFT}}\!\left(\mathbf{a}_\ell\right) \]The resource grid spans \(N_{\rm RB}\times 12\) subcarriers in frequency and 14 OFDM symbols per 1 ms subframe (normal CP).
12.3 LTE Uplink — DFT-s-OFDM (SC-FDMA) TS 36.211 §5.3–5.4
To reduce PAPR and relax the UE power amplifier, the LTE uplink uses DFT-spread OFDM. The transmit chain (TS 36.211 §5.3) is:
\[ \underbrace{d_0,\ldots,d_{M-1}}_{\text{QAM symbols}} \xrightarrow{\;\text{M-pt DFT}\;} \tilde{d}_0,\ldots,\tilde{d}_{M-1} \xrightarrow{\;\text{subcarrier map}\;} \underbrace{a_0,\ldots,a_{N_{\rm FFT}-1}}_{\text{spread over }N_{\rm FFT}} \xrightarrow{\;\text{IFFT}_{N_{\rm FFT}} + \text{CP}\;} s(t) \]The DFT step spreads each QAM symbol across all \(M\) subcarriers allocated to the UE, creating a single-carrier signal in the time domain with PAPR comparable to QPSK even for higher-order modulations. \(M\) must be a product of 2, 3, and 5 (e.g.\ 12, 24, 36, …, 1200) — TS 36.211 Table 5.3-1.
Localised (LFDMA) and interleaved (IFDMA) subcarrier mappings are supported. LTE uses exclusively LFDMA (contiguous allocation):
\[ a_{k} = \begin{cases} \tilde{d}_{k - k_0} & k_0 \le k < k_0 + M \\ 0 & \text{otherwise} \end{cases} \]The received signal after flat-fading channel \(H_k\) and OFDM demodulation:
\[ Y_k = H_k\,\tilde{d}_{k-k_0} + N_k, \quad k_0 \le k < k_0+M \]A single-tap frequency-domain equalizer is then applied before the M-pt IDFT to recover \(\hat{d}_m\).
12.4 LTE Sounding Reference Signal (SRS) TS 36.211 §5.5.3
The SRS is a UL reference signal used by the eNB for uplink channel sounding (wideband CSI acquisition, scheduling, beamforming). Key properties (TS 36.211 §5.5.3):
- Base sequence: Zadoff-Chu (ZC) of length \(M_{\rm SRS}\), cyclic-extended to fill the allocated bandwidth.
- Sequence: \(r^{(\alpha)}_u(n) = e^{j\alpha n}\,\bar{r}_u(n)\), where \(\bar{r}_u\) is the ZC root sequence for group \(u\) and \(\alpha = 2\pi n_{\rm cs}/12\) is the cyclic shift.
- Bandwidth: \(M_{\rm SRS} \in \{4,8,12,\ldots\} \times N_{\rm RB}\) subcarriers; up to 4 nested hopping bandwidths \(m_{\rm SRS,0..3}\).
- Subframe mapping: occupies the last OFDM symbol of the subframe; cannot coexist with PUCCH in same RBs.
- Frequency hopping: controlled by \(b_{\rm hop}\), allowing periodic wideband sounding via interleaved narrow allocations.
12.5 5G NR Physical Layer — Flexible Numerology TS 38.211 §4–5
5G NR uses CP-OFDM for both DL and UL, with DFT-s-OFDM as an optional UL mode. The key flexibility is numerology \(\mu\): subcarrier spacing scales as \(\Delta f = 2^\mu \times 15\,\text{kHz}\), TS 38.211 Table 4.3.2-1.
| μ | Δf (kHz) | Tu (μs) | CPnormal (μs) | Slots / subframe | Symbols / slot | Bands |
|---|---|---|---|---|---|---|
| 0 | 15 | 66.67 | 4.69 | 1 | 14 | FR1 (<1 GHz) |
| 1 | 30 | 33.33 | 2.34 | 2 | 14 | FR1 sub-6 GHz |
| 2 | 60 | 16.67 | 1.17 | 4 | 14 | FR1 / FR2 |
| 3 | 120 | 8.33 | 0.59 | 8 | 14 | FR2 (mmWave) |
| 4 | 240 | 4.17 | 0.29 | 16 | 14 | FR2 reference only |
All numerologies share the same 10 ms frame and 1 ms subframe boundary, enabling mixed-numerology operation within a single carrier (BWP switching). The slot duration is \(T_{\rm slot} = 14 \times (T_u + T_{\rm CP})\) for normal CP.
12.6 5G NR Downlink — CP-OFDM Signal Model TS 38.211 §7.4
The NR DL baseband signal is (TS 38.211 Eq. 7.4-1):
\[ s_{\mu,\ell}(t) = \sum_{k=0}^{N^{\mu}_{\rm FFT}-1} a^{(\mu)}_{k,\ell}\, e^{j2\pi\left(k - \tfrac{N^{\mu}_{\rm FFT}}{2}\right)\Delta f_\mu\,(t - N^{\mu}_{\rm CP}T_c)}, \quad 0 \le t < (N^{\mu}_{\rm CP}+N^{\mu}_{\rm FFT})\,T_c \]where \(T_c = 1/(\Delta f_{\rm ref}\cdot N^{\rm ref}_{\rm FFT}) = 1/(480\,\text{kHz}\times 4096) \approx 0.509\,\text{ns}\) is the NR basic time unit; \(\Delta f_\mu = 2^\mu \times 15\,\text{kHz}\); and \(N^\mu_{\rm FFT}\) scales with bandwidth class and numerology.
Typical NR FFT sizes (100 MHz FR1, μ=1, Δf=30 kHz):
\[N_{\rm FFT}^{\mu=1} = 4096 \;\text{(for 100 MHz FR1, SCS 30 kHz)}\]The NR resource grid spans up to 3300 subcarriers (275 RBs × 12) for 100 MHz / SCS 30 kHz — significantly wider than LTE, reflecting NR's flexible bandwidth part (BWP) architecture.
12.7 5G NR Uplink — CP-OFDM and DFT-s-OFDM TS 38.211 §6.3
Unlike LTE, NR supports both CP-OFDM and DFT-s-OFDM on the uplink
(TS 38.211 §6.3). The UE capability bit
transformPrecoding (TS 38.214 §6.1.3) selects between the two:
- CP-OFDM UL (default): same structure as DL; enables MIMO spatial multiplexing and supports up to 4 layers. Used when PA back-off is less critical (power class 3, FR2).
- DFT-s-OFDM UL (transform precoding enabled): same as LTE SC-FDMA; lower PAPR, better coverage at cell edge. Mandatory for power class 3 UL coverage enhancement; mandatory for PRACH.
The NR DFT-s-OFDM chain (TS 38.211 §6.3.1.4):
\[ y(n) = \text{IFFT}_{N_{\rm FFT}}\!\Bigl[ \underbrace{\text{FFT}_{M}\bigl(d_0,\ldots,d_{M-1}\bigr)}_{\text{DFT precoding}} \text{mapped to }[k_0, k_0+M) \Bigr] + \text{CP} \]NR DFT-s-OFDM adds a key enhancement: spectral shaping (π/2-BPSK) reduces PAPR further to ~0 dB overhead, enabling 5G to operate in extreme coverage scenarios (TS 38.211 §6.3.1.1, NR PUSCH enhancement Rel-16).
12.8 5G NR Sounding Reference Signal (SRS) TS 38.211 §6.4.1.4
NR SRS is significantly enhanced over LTE SRS (TS 38.211 §6.4.1.4):
| Feature | LTE SRS (TS 36.211) | NR SRS (TS 38.211) |
|---|---|---|
| Base sequence | ZC, length MSRS | ZC or low-PAPR type 1/2, length MSRS,b |
| Antenna ports | 1 | 1–4 (Rel-15), up to 8 (Rel-16+) |
| Symbol duration | 1 symbol (last in subframe) | 1–4 consecutive symbols (configurable) |
| Periodicity | 2–160 ms | 1–5120 slots (any numerology) |
| Bandwidth | 4 hopping levels (mSRS,0..3) | 4 hopping levels (BSRS∈{0,1,2,3}) |
| Use | UL channel sounding, scheduling | UL CSI, beamforming, beam management, codebook |
| Comb | comb-2 only (every other SC) | comb-2 or comb-4 |
The NR SRS sequence (TS 38.211 §6.4.1.4.2):
\[ r^{(p)}_{\rm SRS}(n) = e^{j\alpha_p n}\,\bar{r}_{u,v}(n\bmod M_{\rm ZC}), \quad n = 0, 1, \ldots, M_{\rm SRS,b}-1 \]where \(\alpha_p = 2\pi n^{(p)}_{\rm cs}/N_{\rm cs}\) is the cyclic shift for antenna port \(p\); \(\bar{r}_{u,v}\) is a ZC base sequence of group \(u\), base sequence \(v\); and \(N_{\rm cs} \in \{8, 12\}\) (comb 2/4). Mapped to every \(K_{\rm TC}\)-th subcarrier (\(K_{\rm TC} = 2\) or \(4\) for comb-2/4):
\[ a^{(p)}_{k_0 + 2k} = r^{(p)}_{\rm SRS}(k), \quad k = 0, 1, \ldots, \frac{M_{\rm SRS,b}}{K_{\rm TC}}-1 \]12.9 LTE vs NR OFDM Signal Generation — Head-to-Head
- LTE fixes NFFT = 2048 for 20 MHz while NR uses NFFT = 4096 for the same bandwidth. Why does the larger FFT improve performance in NR, and what is the cost in latency and processing complexity?
- An NR gNB serves a cell-edge UE in 5 MHz at μ=0. The UE selects DFT-s-OFDM UL with M=60 subcarriers. Derive the exact CP duration in nanoseconds and the slot duration in μs using TS 38.211 Table 4.3.2-1 values.
- NR SRS uses comb-4 with 4 antenna ports. How many SRS subcarriers per RB does each port occupy? Show that the 4 ports can be orthogonally separated in frequency using cyclic shifts alone, and identify the condition on Ncs for this to work.