Quick answer: Streaming hitches come from oversized cells, missing HLODs, and a loading range that lets the player reach an unloaded cell before it finishes streaming. Build HLODs, lower cell size, raise loading range, and enable async asset compilation to keep stream loads off the game thread.
Here is how to fix Unreal World Partition framerate hitches when crossing cell boundaries in an open world. The game runs at 60 FPS on flat plains, then drops to 5 FPS for half a second every time you enter a new region. The drop coincides with cell loading. World Partition is designed to keep memory bounded, but without HLODs and tuned grid settings, the per-cell load cost surfaces as visible hitches.
The Symptom
Smooth gameplay interrupted by ~100–500 ms hitches at predictable locations — usually the boundaries between Streaming Cells visible in the World Partition Editor grid. Profiling shows time spent in FWorldPartitionStreamingPolicy and asset loading on the game thread.
What Causes This
Cell size too large. Default cell size in the WP defaults to 25600 units (256 m). A cell this size can contain hundreds of actors and dozens of mesh assets. Loading them in one frame is the hitch.
No HLODs built. Without HLODs, distant content is either fully loaded (memory bloat) or absent (pop-in and hitch when it streams in). HLODs provide low-cost proxies that hide loads behind LOD transitions.
Loading range too tight. If the loading range matches the cell size, the player can reach the next cell before it finishes loading. Streaming runs as a high-priority blocking task to catch up, hitching the frame.
Synchronous asset loading. If a streamed actor uses FStreamableManager::LoadSynchronous or hard references to large assets, those loads block the game thread.
The Fix
Step 1: Build HLODs for the world. Open Window → World Partition → World Partition Editor. Select all cells (Ctrl-A), then click Build HLODs. For large worlds, use the commandlet so the build runs in batch on a build machine:
UnrealEditor MyProject.uproject ^
-run=WorldPartitionBuilderCommandlet ^
-Builder=WorldPartitionHLODsBuilder ^
-Map=L_OpenWorld
Step 2: Lower cell size. Open the level’s World Partition Settings. Reduce Cell Size to 6400 (64 m) for typical first-person scale, or even 3200 for dense urban scenes. Smaller cells mean more cells, but each load is cheaper and more granular.
Step 3: Raise the loading range. Set Loading Range higher than Cell Size by 2–3x. With cell size 6400 and loading range 16000, players have 1.5 cells of margin before reaching unloaded territory.
// Runtime tuning via console
wp.Runtime.SetActiveCellLoadingRange 16000
wp.Runtime.SetBlockOnSlowStreaming 1
wp.Runtime.EnableStreaming 1
Step 4: Use Runtime Grids for actor categorization. Add an extra grid named Foliage with cell size 12800 and loading range 25000 for ambient props. Use the default grid for gameplay-critical actors with smaller cell size. Each Actor’s World Partition properties expose a Runtime Grid dropdown.
Step 5: Convert hard refs to soft refs.
// Replace hard reference with TSoftObjectPtr
UPROPERTY(EditAnywhere)
TSoftObjectPtr<UStaticMesh> HeavyMesh;
void UseHeavyMesh()
{
UAssetManager& AM = UAssetManager::Get();
FStreamableManager& SM = AM.GetStreamableManager();
SM.RequestAsyncLoad(HeavyMesh.ToSoftObjectPath(), [this]() {
if (UStaticMesh* M = HeavyMesh.Get()) MeshComp->SetStaticMesh(M);
});
}
Async loads keep the heavy work off the game thread, even if the actor itself is streamed in synchronously.
Prestreaming Player Path
If your player follows predictable paths (rails, vehicle routes), use WorldPartitionStreamingSource with a forward-extending bounds. The streamer prefers cells in the predicted direction:
void AVehicle::Tick(float dt)
{
Super::Tick(dt);
StreamingSource->Location = GetActorLocation() + GetActorForwardVector() * 5000;
StreamingSource->Rotation = GetActorRotation();
}
Verifying With Stats
Run stat WorldPartition to see streaming time per frame. Hitches usually show up as spikes over 8 ms in the Cell Loading row. After tuning, that spike should stay under 2 ms even at full speed.
Understanding the issue
This bug class falls into a pattern that's worth understanding beyond the specific case. In Unreal Engine, the underlying behavior is shaped by how the engine layers its abstractions - the public API you call, the runtime systems that respond, and the platform-specific implementations underneath. A bug at any layer can produce symptoms that look like they originate at a different layer. Triaging effectively means recognizing which layer the symptom belongs to, even when the gameplay code is what's visible.
The specific bug described above is the kind that surfaces during integration rather than unit testing. It depends on a combination of factors: the asset configuration, the runtime state, the platform's specific behavior. In isolation, each piece looks correct; in combination, the bug emerges. This is why thorough integration testing - playing the actual game in realistic conditions - catches things that automated tests miss.
Why this happens
Bugs of this class are particularly easy to ship past internal QA because they often depend on specific runtime conditions - hardware combinations, network states, or asset configurations that QA didn't reproduce. Players hit them in the wild, file reports that are hard to repro, and the bug accumulates negative reviews while engineering tries to recreate the failure mode.
At the engine level, the behavior comes from a deliberate design decision in Unreal. The engine team chose a particular trade-off - usually performance versus convenience, or generality versus specificity - and that trade-off has consequences when you push against it. Understanding the trade-off is what turns 'this bug is mysterious' into 'this bug is the expected consequence of this design'.
Verifying the fix
After applying the fix, the verification step has three parts: confirm the original repro is resolved, confirm no obvious regressions in adjacent functionality, and (for shipping titles) deploy to a small player cohort first and watch the crash and report rates. Each step catches something the others miss.
Reproducibility is the prerequisite for verification. If you can't reliably reproduce the bug pre-fix, you can't reliably verify it post-fix. Spend time getting a clean reproduction before you write any fix code. The fix is fast once you understand the reproduction; the reproduction is the slow part.
Variations to watch for
There's almost always a less obvious case where the same problem applies. The reported case is the one a player hit; the related cases hide because they're rarer or affect fewer players. After fixing the reported case, search the codebase for the pattern - one fix often unlocks several.
Adjacent bugs often share a root cause. After fixing the case you've found, spend an hour searching the codebase for similar patterns. What's the same call with different arguments? The same data flow with a different entity type? The same lifecycle issue in a sibling system? Each match is a candidate for the same fix, or a related fix that prevents future bugs of the same class.
In production
For shipping titles with a long support window, watch for this issue resurfacing after dependency updates. Engine upgrades, driver updates, OS releases - each one can resurface a bug class you thought you'd fixed because the underlying behavior changed slightly. Regression tests catch the obvious ones; player reports catch the rest.
When triaging a similar issue in production, prioritize gathering data over hypothesizing causes. A player report describes a symptom; what you need is a build SHA, a session timestamp, and ideally a screen recording or session replay. With those, the bug becomes tractable. Without them, you're guessing at hypothetical reproductions that may not match what the player actually hit.
Performance considerations
Performance implications matter when this bug class scales with player count or asset count. A bug that fires once per session is annoying; a bug that fires once per frame compounds. After fixing, profile the affected code path under realistic load. The fix that's correct for one entity may be too slow for ten thousand.
Diagnostic approach
Diagnosing this class of bug benefits from a structured approach: confirm the symptom, isolate the variables, hypothesize the cause, and verify the hypothesis before writing fix code. Skipping the isolation step is the most common mistake; without it, fixes often address symptoms while the underlying cause continues to produce other variations.
For Unreal-specific diagnostics, the editor's profiler is the canonical starting point. Capture a representative frame with the symptom present; compare against a frame without the symptom; the diff often points directly at the cause. If the symptom is non-deterministic, capture multiple frames and look for the pattern - the cause is usually a state transition or a specific input value rather than a continuous effect.
Tooling and ecosystem
Modern engine versions ship better tooling for this kind of issue than older versions. If you're on an older release, the diagnostic step may take significantly longer because the tools you'd want don't exist yet. Sometimes the right answer is upgrading rather than fighting through limited tooling.
Within Unreal, the relevant diagnostic surfaces include the standard frame debugger, memory profiler, and engine-specific debug overlays. Each one shows a different facet of what's happening. The frame debugger reveals draw call ordering and state transitions; the memory profiler shows allocation patterns; the debug overlay reveals per-system state. Bugs that resist one tool usually surrender to another - the trick is knowing which tool to reach for first.
Edge cases and pitfalls
Edge cases for this class of issue often involve specific timing: the first frame after a state change, the last frame before a transition, frames where multiple subsystems update simultaneously. Reproducing these reliably is part of what makes the bug class hard to test.
When writing a regression test for this fix, focus on the boundary conditions that surfaced the original bug. Tests that exercise the happy path catch obvious regressions; tests that exercise the boundary catch the subtler regressions that look like new bugs but are really the original returning. The latter are the tests that earn their keep over the long life of the project.
Team communication
When this bug class affects multiple teams (often the case for cross-system issues), early communication prevents duplicate work. The team that owns the symptom may not own the cause. A 15-minute conversation at the start of triage often saves hours of independent investigation.
If this fix touches a system several engineers work in, a short writeup in the team's engineering channel helps. Not a full design doc - a paragraph explaining what was wrong, what's fixed, and what to watch for. Future engineers encountering similar symptoms will search for the fix; making it findable is a small investment that pays back later.
“HLODs hide the load. Smaller cells make the load cheap. Larger loading range hides any remaining cost. All three together keep frame time flat.”
Related Issues
For Level Instance loading issues, see Level Instance Not Loading. For Niagara streaming issues, see Niagara Not Rendering in PIE.
Build HLODs first. Then shrink cells. Then expand loading range. Hitches disappear in that order.