Double-Checked Locking — Optimize¶

Ten before/after walkthroughs that turn correct-but-suboptimal lazy-initialization code into cleaner, faster, or safer forms. Pair with middle, senior, professional.

Table of Contents¶

Lock-on-every-access → DCL
Two volatile reads → one local
DCL on a static → holder idiom
Hand-rolled singleton → enum
Volatile read on hot path → eager final
Lock on this → private lock object
Broken DCL → VarHandle acquire/release
C++ mutex-every-call → call_once
C++ call_once → magic static
Split flag+data → single volatile field
Optimization Tips

1. Lock-on-every-access → DCL¶

Before

static synchronized Conn get() { if (c == null) c = new Conn(); return c; }

Problem: Every read acquires the monitor, forever, though init happens once. After

private static volatile Conn c;
static Conn get() {
    Conn l = c;
    if (l == null) synchronized (Conn.class) { l = c; if (l == null) c = l = new Conn(); }
    return l;
}

Why: Removes the lock from the post-init fast path; only the one-time build is synchronized. Trade-off: Adds the famous correctness burden (volatile, two checks). Often the holder idiom (Opt 3) is a better target.

2. Two volatile reads → one local¶

Before

if (instance == null) { ... }
return instance;          // second volatile read on the fast path

Problem: The hot path reads the volatile field twice (check + return), each an acquire load. After

Singleton l = instance;   // one volatile read
if (l == null) { ... l = instance; ... }
return l;

Why: Halves volatile reads on the common path; on weak ISAs that's a real fence saved. Trade-off: None meaningful; standard idiom. Keep the local consistent inside the lock too.

3. DCL on a static → holder idiom¶

Before: A correct volatile DCL guarding a static singleton. After

private static class H { static final Singleton I = new Singleton(); }
static Singleton get() { return H.I; }

Why: Lazy + thread-safe by the JLS, no volatile, and the fast path is a plain read (no acquire fence) — strictly cheaper than DCL. Trade-off: Statics only; can't be used for instance fields.

4. Hand-rolled singleton → enum¶

Before: DCL or holder singleton needing serialization/reflection hardening. After

enum Singleton { INSTANCE; }

Why: JVM guarantees single, thread-safe init; immune to reflection and serialization duplication. Trade-off: Eager at class-load; can't extend a class.

5. Volatile read on hot path → eager final¶

Before: volatile DCL where construction is actually cheap. After

private static final Singleton INSTANCE = new Singleton(); // plain read, no fence

Why: If laziness buys nothing, eager static final is trivially correct and the fast path is a plain read. Trade-off: Loses laziness — only valid when early construction is acceptable.

6. Lock on `this` → private lock object¶

Before

synchronized (this) { ... }   // external code can contend on your monitor

After

private final Object lock = new Object();
synchronized (lock) { ... }

Why: Encapsulates the monitor so unrelated synchronized(yourObject) elsewhere can't deadlock or contend with your init. Trade-off: One extra field; negligible.

7. Broken DCL → VarHandle acquire/release¶

Before: Non-volatile DCL (broken) where you want explicit, documented ordering. After

T l = (T) VH.getAcquire(this);
if (l == null) synchronized (lock) {
    l = (T) VH.getAcquire(this);
    if (l == null) { l = build(); VH.setRelease(this, l); }
}
return l;

Why: Expresses exactly the needed acquire/release ordering; correct on all ISAs. Trade-off: More verbose than volatile, which the JIT compiles to the same fences — prefer plain volatile unless you specifically need explicit modes.

8. C++ mutex-every-call → call_once¶

Before

Singleton& get() { std::lock_guard<std::mutex> g(m); static Singleton* p; if(!p) p=new Singleton(); return *p; }

Problem: Locks the mutex on every call. After

Singleton& get() { std::call_once(flag, []{ p = new Singleton(); }); return *p; }

Why: call_once runs the init once with correct publication and a cheap fast path after. Trade-off: Still slightly heavier than a magic static (Opt 9).

9. C++ call_once → magic static¶

Before: std::call_once + once_flag + pointer. After

Singleton& get() { static Singleton s; return s; } // thread-safe since C++11

Why: The compiler emits a fast already-initialized check (an internal DCL) and handles ordering; simplest correct form, no manual flag/pointer, no heap leak. Trade-off: Init order across translation units is less explicit than call_once; fine for most cases.

10. Split flag+data → single volatile field¶

Before

private volatile boolean ready;
private Index index;            // plain

Problem: Correctness depends on read-ordering between two fields; brittle under refactor. After

private volatile Index index;  // DCL on index == null directly

Why: One field, one happens-before guarantee — fewer moving parts, harder to break. Trade-off: None; this is strictly simpler and safer.

Optimization Tips¶

The biggest "optimization" is usually replacing DCL with the holder idiom — it's simpler and has a plain-read fast path. Measure before assuming DCL is faster.
On the hot path, a plain read beats a volatile read on weak ISAs; prefer designs (holder, eager final) whose steady state is a plain read.
Always snapshot the volatile field into a local to avoid a second acquire load.
Don't optimize laziness you don't need — eager final is the cheapest correct thing when early init is acceptable.
Benchmark steady-state reads with JMH / Google Benchmark, on the target ISA, with the result blackholed — naive loops mislead.
In C++, prefer magic static; reach for atomics-DCL only when you must, and never use volatile for threading.

Double-Checked Locking — Optimize¶

Table of Contents¶

1. Lock-on-every-access → DCL¶

2. Two volatile reads → one local¶

3. DCL on a static → holder idiom¶

4. Hand-rolled singleton → enum¶

5. Volatile read on hot path → eager final¶

6. Lock on this → private lock object¶

7. Broken DCL → VarHandle acquire/release¶

8. C++ mutex-every-call → call_once¶

9. C++ call_once → magic static¶

10. Split flag+data → single volatile field¶

Optimization Tips¶

6. Lock on `this` → private lock object¶