Why Synchronisation Can Break Your Entire 5G Network

SyncE, PTP, grandmaster clocks, and the timing failures that cause mass call drops and handover failures

1. What Is Synchronisation — And Why 5G Is More Demanding Than LTE

Every 5G base station must be synchronised. Not approximately — precisely. For TDD (Time Division Duplex) operation, which covers the vast majority of 5G NR deployments in the GCC, all gNBs must transmit and receive in exactly the same time slots. When they do not, downlink transmissions from one cell bleed into uplink receive windows of adjacent cells. This is called base station-to-base station interference, and it degrades network performance across entire regions — not just individual sites.

The synchronisation requirement for 5G TDD is a phase alignment of less than 1.5 microseconds between adjacent cells. To put that in perspective, light travels 450 metres in 1.5 microseconds. This is a hardware-level precision requirement that the entire transport network must support, from the grandmaster clock in the data centre all the way to the RU on the tower.

LTE FDD did not have this problem — frequency division allows simultaneous transmit and receive without phase alignment. 5G TDD changed the game. Synchronisation moved from a nice-to-have to a network-critical function that transport architects must design and operate with the same rigour as routing and QoS.

2. Real Network Architecture — The Timing Distribution Chain

From GPS to RU: Every Node in the Chain

NodeRoleProtocolAccuracy Requirement
GNSS / GPS ReceiverPrimary time source — absolute time referenceGPS/PPS — feeds Grandmaster< 100 ns (GNSS disciplined)
Grandmaster Clock (GM)Distributes time via PTP to all downstream nodesIEEE 1588v2 PTP — ITU-T G.8275.1< 100 ns from GNSS
Boundary Clock (BC)Terminates PTP from GM, re-originates to downstreamPTP slave toward GM, master toward downstream< 1 µs accumulated error
Transparent Clock (TC)Passes PTP through, corrects for transit delayPTP with correction field updateMinimal — adds correction only
SyncE-enabled switch/routerFrequency synchronisation — locks to Ethernet symbol clockITU-T G.8261 / G.8262Frequency: ±4.6 ppb
RU / DUEnd consumer of timing — applies phase alignment to radioPTP slave — typically G.8275.1 profile< 1.5 µs from adjacent cell

The key distinction between SyncE and PTP is what they provide: SyncE delivers frequency synchronisation — the clock ticks at the right rate. PTP delivers phase and time synchronisation — the clock ticks at the right rate AND is aligned to absolute UTC time. 5G TDD requires both. SyncE provides the stable frequency reference, PTP rides on top to deliver phase alignment.

Full Timing Support vs Partial Timing Support

ITU-T G.8275.1 defines Full Timing Support (FTS) — every node in the transport path is PTP-aware and acts as a Boundary Clock or Transparent Clock. This is the recommended architecture for 5G TDD. It provides the best accuracy because every transit node corrects for its own forwarding delay.

G.8275.2 defines Partial Timing Support (PTS) — PTP passes through non-PTP-aware nodes as regular UDP traffic. This introduces packet delay variation (PDV) because routers apply variable queuing delays to PTP packets. PTS requires careful link design and is only viable with very short transport chains and low utilisation links.

Field Note: In a GCC operator with a mix of new and legacy aggregation switches, we often see a hybrid: FTS on the fronthaul and upper midhaul segments (where latency budget is tightest), and PTS on the backhaul segment to the core. This works if PDV on the backhaul segment is well-controlled — but it requires explicit measurement, not assumption.

3. Step-by-Step Real Flow — Timing Distribution in Oman

Step 1 — GNSS Receiver at Core DC

A GNSS receiver (GPS/GLONASS) at the Muscat Core DC acquires satellite signals and disciplines a local oscillator to absolute UTC time with accuracy better than 100ns. This feeds the Grandmaster Clock, which becomes the authoritative time source for the entire network.

Step 2 — Grandmaster Distributes PTP

The GM originates IEEE 1588v2 PTP Announce and Sync messages on its downstream ports toward the aggregation PE routers. Each Sync message carries a precise timestamp. The downstream Boundary Clocks use the two-step timestamp mechanism to calculate their offset from the GM and correct their local clocks.

Step 3 — Boundary Clocks at PE Routers

The aggregation PE routers at Seeb, Ruwi, and Barka act as Boundary Clocks. They terminate the PTP session from the GM, lock their local clocks to the received time, and re-originate PTP sessions toward the access layer switches and ultimately the DU/RU. Each BC introduces some residual time error — typically 50-200ns per hop on well-configured hardware.

Step 4 — SyncE on the Access Segment

The access switches and transport links between PE and DU run SyncE alongside PTP. SyncE locks the Ethernet physical layer clock to the upstream reference, providing a clean frequency reference. This ensures that even if PTP packets experience momentary queuing delays, the frequency base remains stable and PTP recovery is faster.

Step 5 — RU Acquires Phase Lock

The RU runs a PTP slave clock. It receives Sync messages from the DU (which may relay PTP) or directly from the access BC. It calculates its time offset, adjusts its local oscillator, and achieves phase lock within the required 1.5 µs budget. The 5G radio frame timing is now aligned with all other cells in the network.

Key insight: The total phase error budget of 1.5 µs must be shared across the entire chain from GM to RU. With 5-7 hops, each hop can contribute no more than 200-300 ns. Any single misconfigured BC or high-PDV link segment can consume the entire budget alone.

4. Practical Example — Timing Failure Causing Call Drops

An operator in the GCC reports widespread handover failures and call drops at multiple sites in a district, not a single site. RAN performance dashboards show increased DL interference and rising PDCCH errors. Individual site checks find nothing wrong with the radio configuration. The issue is eventually traced to timing.

SymptomRoot CauseHow Detected
Mass handover failures across 12 sitesBoundary Clock at Seeb PE lost PTP lock after software upgrade — BC entered holdover modePTP clock state monitoring: BC showed HOLDOVER for 4 hours
Holdover degradationLocal oscillator in BC drifted 2.1 µs over 4 hours — exceeded 1.5 µs budgetTime error (TE) monitoring showed TE > 1500 ns at DU level
TDD interference spikeAdjacent cells lost phase alignment — DL/UL slot collisionRAN interference reports showed self-interference pattern
ResolutionPTP session restored after manual BC restart — GNSS reference re-acquiredTE returned to < 200 ns, interference eliminated within 2 minutes

The critical failure point was the absence of real-time Time Error monitoring. The BC entered holdover silently. No alarm was generated. By the time RAN performance degraded visibly, the timing error had been accumulating for 4 hours across all sites served by that BC.

5. Key Parameters and Configurations

ParameterStandard / RequirementTypical Configured ValueImpact if Out of Spec
End-to-end Time Error (TE)ITU-T G.8271.1: < 1.5 µs at RUTarget: < 500 ns for marginTDD interference, handover failures, HARQ timing errors
Packet Delay Variation (PDV)G.8271.2: < 1 µs for PTS networksMeasure on live path with TWAMP or PTP probeHigh PDV = BC cannot correct accurately = TE accumulation
SyncE SSM quality levelG.8264: QL-PRC for GM-derived signalQL-PRC on GM output, QL-EEC1 on access switchesWrong QL = clock cascades to wrong source, frequency error
PTP profileG.8275.1 (FTS) or G.8275.2 (PTS)G.8275.1 on all fronthaul/midhaul segmentsG.8275.2 on high-utilisation links causes excessive PDV
BC holdover specificationG.8273.2 Class C: < 50 ns drift in 1000sUse Class C or better BCs on all aggregation hubsClass A/B BCs drift too fast in holdover — 1.5 µs exceeded quickly
BMCA (Best Master Clock Algo)IEEE 1588 BMCA — clock class, accuracy, priorityGM: clock class 6, priority1 128; BC: class 135 in holdoverWrong priority = incorrect GM election, suboptimal clock source

6. Common Issues in the Field

  • GNSS signal loss at GM — caused by antenna obstruction, cable damage, or RF interference. GM enters holdover mode. If holdover oscillator quality is poor (OCXO vs TCXO), phase error accumulates rapidly. Deploy dual GM with automatic switchover and monitor GNSS lock status continuously.
  • Asymmetric path delay — PTP assumes the forward and return paths have equal delay. If PTP Sync packets travel on a different physical path than Delay_Request packets (e.g. different fibre routes in a ring), the calculated offset is wrong. This is particularly common in ring topologies with asymmetric fibre lengths. Requires asymmetry correction configuration on BCs.
  • PTP packets queued with data traffic — if transport switches do not apply priority queuing to PTP packets (DSCP CS6 or EtherType 0x88F7 for L2 PTP), they get queued with data traffic. Under congestion, PTP packets experience variable delay — high PDV — and BC accuracy degrades.
  • Mixed PTP-aware and non-PTP-aware nodes — a legacy switch in the transport chain that does not support BC or TC passes PTP as regular UDP. The variable forwarding delay it introduces accumulates as PDV. The BC downstream sees degraded input and cannot achieve the required accuracy.
  • Holdover alarms not monitored — this is the most dangerous operational gap. A BC in holdover mode continues to distribute timing and appears functional. Without active monitoring of PTP clock state and TE, the degradation is invisible until RAN performance collapses.
  • SyncE SSM misconfiguration — Synchronisation Status Messages tell downstream nodes about clock quality. If SSM is disabled or misconfigured, a switch may lock to a low-quality clock source (e.g. a free-running oscillator) and propagate frequency error downstream.

7. Troubleshooting Approach

  • Start at the GM — verify GNSS lock status, clock class (should be 6), and PTP session state. A GM in holdover or freerun state is the source of most widespread timing failures.
  • Trace the PTP clock chain — on each BC in the chain, verify: clock state (LOCKED/HOLDOVER/FREERUN), upstream master identity, time offset from master, and mean path delay. A BC showing high offset or HOLDOVER state is the break point.
  • Measure Time Error at DU — use an external time interval analyser or the DU’s built-in PTP monitoring to measure TE against a GPS reference. This gives the actual end-to-end accuracy at the consumer of timing.
  • Measure PDV on the transport path — inject PTP probes or use TWAMP with timestamping to measure packet delay variation between GM and BC, and between BC and DU. PDV above 1 µs on any segment indicates a queuing or routing problem.
  • Check SyncE chain integrity — verify SSM quality levels on each link. Use: show synchronization or equivalent CLI. Ensure all links show QL-PRC or QL-EPRTC derived quality. Any link showing QL-DNU (Do Not Use) or QL-SEC indicates a problem.
  • Verify PTP packet priority — confirm PTP Sync and Delay_Request packets are in the priority queue on all transport nodes. Check for PTP packet drops in queue statistics during congestion periods.

8. Design Recommendations — Consultant Level

  • Deploy dual GNSS-disciplined GMs at geographically separate locations — test automatic switchover quarterly. A single GM failure in a TDD network causes immediate widespread outage. This is non-negotiable for any serious 5G deployment.
  • Use Class C or better Boundary Clocks at all aggregation hubs — Class C BCs (ITU-T G.8273.2) maintain < 50ns drift for 1000 seconds in holdover. Class A/B drift too fast for TDD requirements. Specify this in equipment procurement RFPs explicitly.
  • Enable PTP packet priority on all transport nodes — map PTP EtherType (0x88F7 for L2) or DSCP CS6 (for L3/UDP PTP) to the strict priority queue on every switch and router in the timing chain.
  • Deploy asymmetry correction on ring topologies — measure actual fibre delay asymmetry using a GPS receiver at both ends and configure correction values on BCs. Do not assume symmetric paths in ring networks.
  • Implement continuous TE and PDV monitoring — deploy a timing monitoring system that measures TE at the RU or DU level and alerts when TE exceeds 1 µs (giving 500 ns margin before the 1.5 µs limit). Do not rely on router/switch alarms alone.
  • Separate PTP traffic onto dedicated VLANs or use G.8275.1 full on-path support — never mix PTP with high-bandwidth data traffic on the same queuing domain without strict priority protection.

9. Summary — Practical Takeaways

Synchronisation is the silent killer of 5G TDD networks. When it fails, the symptoms look like radio problems — interference, handover failures, throughput drops — and the investigation invariably starts in the wrong place. By the time the transport team is involved, hours of service impact have already occurred.

The preventive measures are straightforward: dual GMs, Class C BCs, PTP priority queuing, asymmetry correction, and continuous TE monitoring. Every one of these must be designed in from the start. Retrofitting synchronisation infrastructure into an operational network is painful, expensive, and service-affecting.

TakeawayAction
1.5 µs phase budget is shared across all hopsBudget per hop: 200-300 ns max — verify with end-to-end TE measurement
Dual GM is mandatory, not optionalDeploy two geographically separate GNSS-disciplined GMs with auto-failover
BC holdover quality determines resilienceSpecify Class C (G.8273.2) BCs in all procurement — do not accept Class A/B
PTP packets must be in priority queueEnable PTP priority on every transport node — verify during acceptance testing
Monitor TE continuously, not reactivelyAlert at 1 µs TE — do not wait for RAN KPI degradation to detect timing issues

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top