The k-Server Problem — Senior Level¶
Prerequisites¶
- Middle Level — the problem definition (
kservers in a metric space, serve each request by moving a server, minimize total distance), the deterministic lower boundkon every metric with≥ k+1points, the Double-Coverage (DC) algorithmk-competitive on the line and on trees, and the statement of thek-server conjecture - Competitive Analysis — Senior — the adversary hierarchy, the potential method as the engine of competitive proofs, and Yao's principle as von Neumann minimax (reused for the randomized lower bounds)
- Paging and Caching — Senior — paging as the uniform-metric special case; the
H_krandomized optimum that the general randomized conjecture generalizes
Table of Contents¶
- What Senior-Level k-Server Theory Is About
- The Work Function: Definition and Properties
- The Work Function Algorithm (WFA)
- Koutsoupias–Papadimitriou: WFA Is (2k−1)-Competitive
- The k-Server Conjecture: Status
- Metrical Task Systems: the Strict Generalization
- The Randomized k-Server Conjecture and the HST Breakthroughs
- Special Metrics and Refined Results
- Learning-Augmented k-Server and MTS
- Worked Piece: Double-Coverage on Trees Is k-Competitive
- Decision Framework
- Research Pointers
- Key Takeaways
What Senior-Level k-Server Theory Is About¶
The middle level fixes the landscape. The k-server problem — k mobile servers in a metric space (M, d), each request a point of M that must be covered by moving some server there, minimizing total distance — has a clean deterministic lower bound of k on every metric with at least k+1 points, matched by Double-Coverage on the line and on trees. The k-server conjecture of Manasse, McGeoch, and Sleator (1988) asserts that a k-competitive deterministic algorithm exists on every metric. That conjecture is the organizing question of the whole field, and it is still open.
Senior-level k-server theory is the deep machinery built around that gap between "known lower bound k" and "best known general upper bound," plus the place of k-server inside the broader hierarchy of online metric problems:
- The Work Function Algorithm (WFA) and the
2k−1bound. Koutsoupias and Papadimitriou (1995) proved that a single, metric-oblivious algorithm — WFA, which at each request moves to minimize a "lookback-optimal cost plus travel" objective — is(2k−1)-competitive on every metric space. This is the best general bound known and the strongest evidence for the conjecture. The proof is a tour de force of potential-function design (the quasi-convexity of work functions, the extended-cost / pseudo-potential argument). - Metrical Task Systems (MTS) place
k-server in a hierarchy. Borodin, Linial, and Saks (1992) defined MTS, a strict generalization in which an online player moves amongnstates paying movement plus task-processing cost.k-server reduces to MTS on its configuration space. MTS has a tight deterministic ratio of2n−1and a randomized ratio ofΘ(log n / log log n)— both of which frame what is and is not possible fork-server. - The randomized
k-server conjecture and the HST revolution. The conjecture's randomized cousin asks for anO(\mathrm{polylog}\,k)-competitive randomized algorithm on every metric. A decade of work — embedding metrics into hierarchically separated trees (HSTs), solving a fractional relaxation by min-cost flow / mirror descent, then rounding — producedO(\log^2 k \log n)(Bansal–Buchbinder–Mądry–Naor 2011) and ultimatelyO(\log^6 k)polylog bounds (Bubeck–Cohen–Lee–Lee–Mądry 2018; Lee 2018). These are notk-competitive; they are polylogarithmic, and whether the cleanO(\log k)target is achievable on all metrics remains open.
The through-line: paging (uniform metric) collapses all geometry, leaving combinatorics; the general k-server problem is where the geometry fights back, and the tools — work functions, potential functions over server configurations, metric embeddings into trees — are the senior-level content. Everything in ../03-paging-and-caching-theory/senior.md is the trivial-metric shadow of what follows.
The Work Function: Definition and Properties¶
The central object is the work function. Fix a metric (M, d), k servers, and an initial configuration A₀ (a multiset of k points). Let σ = r₁, …, r_t be the requests seen so far.
Definition (work function). For any configuration
C(a multiset ofkpoints), the work function valuew_t(C)is the minimum total offline cost to server₁, …, r_tstarting fromA₀and ending with the servers exactly atC.
In words, w_t(C) is the cheapest way an omniscient offline player could have served the whole history and be parked at configuration C right now. It is computed by dynamic programming over configurations:
w_0(C) = d(A_0, C) # min-cost matching A_0 → C
w_t(C) = min over configs X containing r_t of
( w_{t-1}(X) + d(X, C) )
where d(X, C) denotes the minimum-cost perfect matching distance between two k-point configurations (the minimum total distance to move servers from X to C). The inner constraint "X contains r_t" enforces that request r_t was served — some server sat on r_t at time t.
Two structural properties make WFA analyzable.
1. Lipschitz / non-expansiveness. For any two configurations C, C':
Ending one matching-distance away changes the optimal cost by at most that distance. This is immediate: an offline schedule ending at C can be extended to end at C' by paying d(C, C') more.
2. Quasi-convexity. This is the deep property and the engine of the 2k−1 proof.
Lemma (quasi-convexity of work functions). For any configurations
X, Y, there is a bijection (a pairing of the points ofXwith the points ofY) such that, splitting along the pairing into "min" and "max" halvesX∧YandX∨Y,
Quasi-convexity says work functions behave like convex functions with respect to the lattice structure on configurations: you cannot decrease the sum by "swapping" servers between two configurations along the right pairing. It is what lets the analysis control how the minimizing configuration moves as requests arrive, and it is preserved under the work-function update — the crux lemma the entire competitive proof leans on.
The cost of all this power is computational: maintaining w_t exactly requires tracking the work-function value over all relevant configurations, of which there can be Θ(n^k) for an n-point metric. WFA is a theoretical instrument, not a production cache policy.
The Work Function Algorithm (WFA)¶
WFA is the online algorithm induced by the work function. Suppose WFA's servers are currently at configuration A_{t-1}, and request r_t arrives.
WFA rule. Move to the configuration
Equivalently: among configurations that serveCcontainingr_tthat minimizesr_t, pick the one that balances "looks cheap in hindsight" (w_t(C), the retrospective optimal cost ending there) against "is close to where I am" (d(A_{t-1}, C), my travel cost).
A standard simplification: it suffices to move a single server to r_t, choosing which server s (at point a_s) to send so as to minimize w_t(A_{t-1} − a_s + r_t) + d(a_s, r_t). WFA never moves more than the one server that serves the request.
The two terms encode the algorithm's philosophy:
w_t(C)is retrospection: it pulls WFA toward configurations that an offline optimum would have found cheap, i.e., toward "where the action has been." This is what makes WFA adaptive to the request structure — it is not a fixed greedy rule like DC; it reconstructs the optimal offline trajectory's endpoint and chases it.d(A_{t-1}, C)is inertia: it penalizes moving far, preventing WFA from teleporting servers around in response to retrospection alone.
The genius is that this single, metric-independent rule — no special-casing for lines, trees, or stars — achieves 2k−1 everywhere. Compare DC, which is k-competitive but only defined and analyzed on trees (where "move all servers adjacent to the request toward it" makes geometric sense). WFA needs no such structure.
For paging (uniform metric), WFA specializes to a marking-flavored policy and recovers k-competitiveness; the general bound 2k−1 is what you pay for arbitrary geometry.
Koutsoupias–Papadimitriou: WFA Is (2k−1)-Competitive¶
The landmark result is:
Theorem (Koutsoupias–Papadimitriou 1995). On every metric space, the Work Function Algorithm is
(2k − 1)-competitive.
Published as "On the k-Server Conjecture" (JACM 42(5), 1995), this remains — three decades on — the best known general upper bound for deterministic k-server. It does not prove the conjecture (k), but it is the same order of magnitude, and it holds with no metric assumptions whatsoever, which earlier k-competitive results (DC on trees, MARKING on uniform) could not claim.
The potential argument at a high level¶
The proof is a potential-function argument over work functions; here is the architecture without the full algebra.
Define a potential Φ_t that is a function of the entire current work function w_t, not just of WFA's configuration. The canonical choice is a "sum of extended / pseudo-costs":
— more precisely, Φ_t is built from the work-function values at a carefully chosen family of configurations (related to the current configuration by single-server displacements), summed so as to capture "how much retrospective slack the work function has accumulated." The actual definition uses the minimum over orderings of the servers of a telescoping sum of work-function values along single-server moves; quasi-convexity is what guarantees this minimum is well-behaved.
The proof then establishes two inequalities per request r_t, exactly in the potential-method shape ALG_move + ΔΦ ≤ c · OPT_move:
- The work function rises by at least OPT's cost. Because
w_t(C) ≥ w_{t-1}(C)for allC(more requests cannot lower the optimal cost), and the minimum of the work function tracks the offline optimum, the increase inmin_C w_t(C)over the sequence equalsOPT(σ). So summing the work-function increase recovers OPT's total cost. - WFA's move cost is paid by the potential drop plus a
(2k−1)share of the work-function rise. This is the quasi-convexity-driven step. When WFA moves a server to server_t, the cost of that move is shown — using the Lipschitz and quasi-convexity properties ofw_t— to be at most(2k−1)times the local increase inmin_C w_t(C), minus the change inΦ. The quasi-convexity lemma is invoked to bound how the minimizing configuration shifts, which is exactly what controls WFA's travel.
Telescoping the per-request inequality over σ, the ΔΦ terms cancel and the work-function increments sum to OPT(σ):
The number 2k − 1 emerges from the combinatorics of the potential: there are essentially k server positions whose displacements must be charged, and the "doubling" comes from charging both WFA's move and the worst-case shift of the minimizing configuration — 2k charges minus one because the served point is shared. The argument is delicate enough that simplifications and re-presentations (Bartal–Koutsoupias; Borodin–El-Yaniv's textbook treatment) remain valuable; the bottom line is that 2k−1 is what the quasi-convexity bookkeeping yields, and tightening it toward k for general metrics is precisely the open problem.
What WFA does not prove. WFA is not known to be
k-competitive in general — there are metrics where its competitive ratio is known to be strictly more thankfor smallkin the analysis, though no metric is known where any deterministic algorithm needs more thank. The conjectured truth isk;2k−1is the proven ceiling.
The k-Server Conjecture: Status¶
Conjecture (Manasse–McGeoch–Sleator 1988). On every metric space with at least
k+1points, there is a deterministic online algorithm that isk-competitive for thek-server problem.
The lower bound k (matching the conjecture's target) was proved by the same authors: on any metric with k+1 points, an adversary requesting the unique point with no server forces every deterministic algorithm to pay at least k times the offline optimum. So the conjecture is "the obvious lower bound is achievable." Its status, case by case:
| Metric / case | Best known | Status |
|---|---|---|
| Uniform (paging) | k (MARKING / LRU) | Proved |
| Line | k (Double-Coverage) | Proved |
| Trees | k (Double-Coverage, Chrobak–Larmore) | Proved |
k = 2, any metric | 2 (Manasse–McGeoch–Sleator) | Proved |
k = n − 1 points | k | Proved |
General metric, general k | 2k − 1 (WFA, Koutsoupias–Papadimitriou 1995) | OPEN |
So the conjecture is settled for every "nice" metric and for small k, and the general case is bracketed between the proven lower bound k and the proven upper bound 2k−1 of WFA. Closing that factor-of-≈2 gap for arbitrary metrics is one of the longest-standing open problems in online computation. WFA is the strongest candidate algorithm — most experts believe WFA itself is k-competitive on every metric, but no proof is known.
The randomized k-server conjecture is the modern frontier and is discussed below: it asks for O(\mathrm{polylog}\,k) (ideally O(\log k)) rather than k, and the breakthroughs of the 2010s answer it affirmatively at the polylog level while leaving the precise exponent — and O(\log k) itself — open.
Metrical Task Systems: the Strict Generalization¶
To place k-server in context, step up to Metrical Task Systems (MTS), introduced by Borodin, Linial, and Saks, "An Optimal On-line Algorithm for Metrical Task Systems" (JACM 39(4), 1992).
Definition (MTS). There are
nstates forming a metric space(S, d). The online player occupies a state. At each step a task arrives, specified by a vector of processing costsc(s) ≥ 0, one per state. The player may first move from its current statesto a states'(payingd(s, s')), then process the task there (payingc(s')). Total cost is movement plus processing, summed over all tasks; minimize it.
MTS is strictly more general than k-server: a k-server instance on metric M becomes an MTS whose states are the O(\binom{|M|}{k}) server configurations, with d the matching distance between configurations, and each request r encoded as a task charging 0 to configurations containing r and ∞ otherwise (forcing the server to be present). So k-server reduces to MTS on its configuration space, and any MTS competitive bound applies — but the MTS state count n is exponential in k, so MTS bounds in terms of n are far weaker than the k-server-specific 2k−1.
The MTS results that frame the hierarchy:
Theorem (Borodin–Linial–Saks 1992). The deterministic competitive ratio of MTS on
nstates is exactly2n − 1, achieved by a work-function algorithm and matched by a lower bound on the uniform metric. For randomized MTS against the oblivious adversary the ratio isΘ(\log n / \log\log n)(upper bound via tree embeddings; lower bound by Bartal–Bollobás–Mendel and Bartal–Linial–Mendel–Naor).
Two senior takeaways:
- The MTS work-function algorithm is the direct ancestor of WFA —
k-server's WFA is the MTS algorithm specialized to the configuration-space MTS, and the2n−1MTS bound is the philosophical parent of2k−1. - The tight randomized MTS ratio
Θ(\log n / \log\log n)is what makes the randomizedk-server bounds plausible: solvingk-server through MTS on HSTs (next section) inherits MTS's polylog randomized power, withncontrolled by the metric's HST representation rather than the naive configuration count.
Hierarchy of online metric problems (increasing generality):
Paging ⊂ k-server (general metric) ⊂ Metrical Task Systems
(uniform (WFA: 2k−1 det; (BLS: 2n−1 det, tight;
metric; O(polylog k) rand) Θ(log n / loglog n) rand)
k det;
H_k rand)
The Randomized k-Server Conjecture and the HST Breakthroughs¶
Randomized
k-server conjecture. On every metric, there is a randomized online algorithm that isO(\mathrm{polylog}\,k)-competitive against the oblivious adversary (conjecturallyO(\log k)).
For uniform metrics this is settled and tight at H_k = Θ(\log k) (paging — see ../03-paging-and-caching-theory/senior.md). For general metrics it was open for two decades. The resolution at the polylog level came through a uniform strategy: embed the metric into a hierarchically separated tree (HST), solve a fractional relaxation there, and round.
The HST + fractional / MTS pipeline¶
The recipe that powered the breakthroughs:
- Metric embedding. By Fakcharoenphol–Rao–Talwar (FRT 2003), any
n-point metric embeds into a distribution over HSTs with expected distortionO(\log n). Solvingk-server on HSTs (and paying the distortion) suffices. - Fractional / convex relaxation. On an HST, the
k-server problem is relaxed to a fractional problem — server mass distributed continuously over tree leaves — which becomes a min-cost flow / allocation or a metrical task system on the tree. This convex problem is solved online by primal-dual or mirror-descent / regularization methods (entropic regularization on the tree's allocation polytope). - Rounding. The fractional online solution is rounded to an integral randomized server movement online, losing only logarithmic factors.
What was proved, and by whom¶
Theorem (Bansal–Buchbinder–Mądry–Naor 2011/2015). There is a randomized algorithm that is
O(\log^2 k \, \log^3 n)-competitive (commonly cited asO(\log^2 k \log n)) fork-server on anyn-point metric — the first\mathrm{polylog}-competitive randomized algorithm for general metrics.
Bansal, Buchbinder, Mądry, and Naor, "A Polylogarithmic-Competitive Algorithm for the k-Server Problem" (FOCS 2011; JACM 2015), built the algorithm via an online fractional solver on HSTs (min-cost-flow / primal-dual) plus rounding. This broke the long-standing barrier — before it, no general-metric randomized bound beating the deterministic 2k−1 was known for large k.
Theorem (Bubeck–Cohen–Lee–Lee–Mądry 2018; Lee 2018). There is an
O(\log^6 k)-competitive randomized algorithm fork-server on HSTs (henceO(\log^6 k \cdot \log n)or\mathrm{polylog}on general metrics after embedding), with the dependence onkalone made polylogarithmic.
Bubeck, Cohen, Lee, Lee, and Mądry, "k-server via multiscale entropic regularization" (STOC 2018), and James Lee, "Fusible HSTs and the randomized k-server conjecture" (FOCS 2018), used multiscale entropic regularization (a mirror-descent view on the HST's recursive structure) to drive the dependence on the number of servers down to polylog in k — closer to the conjectured O(\log k). Lee's fusible-HST construction supplies the metric-side ingredient that lets the HST embedding cooperate with the entropic solver.
Where it stands¶
- These results confirm the randomized conjecture at the polylog level:
O(\mathrm{polylog}\,k)is achievable on every metric. - They are emphatically not
k-competitive and not even claimed to be — they are polylogarithmic, an exponentially smaller ratio than the deterministic2k−1whenkis large. - The exact optimal exponent is open: whether
O(\log k)(matching the uniform-metricH_k) is achievable on all metrics is the remaining target. The mirror-descent / regularization viewpoint, which unifies these algorithms with online learning (the regret framework in../01-competitive-analysis/senior.md), is the leading approach.
Special Metrics and Refined Results¶
Between "uniform" (paging) and "fully general," several structured metrics have sharper, named results.
Weighted paging / weighted caching¶
Pages have non-uniform fetch costs — the metric is a weighted star (center = "out of cache," leaves = pages, leaf-to-center distance = fetch cost). This is harder than uniform paging but still tractable:
Weighted paging is
k-competitive deterministically (Chrobak–Karloff–Payne–Vishwanathan for the structure; the cleank-competitive bound is often attributed via the primal-dual method). Young'sk-competitive landlord / GREEDYDUAL generalizes LRU to weighted caching. The randomized weighted-paging ratio isO(\log k), proved by the primal-dual method of Bansal, Buchbinder, and Naor, "A Primal-Dual Randomized Algorithm for Weighted Paging" (FOCS 2007; JACM 2012).
The primal-dual / online-LP technique developed for weighted paging is the methodological seed of the general-metric randomized results above — weighted paging is the simplest non-uniform metric where fractional-then-round on a star already needs the LP machinery.
Resource-augmented (h, k)-server¶
By analogy with paging's resource augmentation, the (h, k)-server problem compares an online algorithm with k servers against an offline optimum with only h ≤ k servers. Extra servers tame the ratio: WFA and related algorithms achieve ratios that improve as k/h grows, and resource augmentation is a standard lens for explaining why server systems with slack behave far better than the worst-case k suggests. The fractional bounds above also have (h,k) refinements where the competitive ratio degrades gracefully as h → k.
Generalized / weighted k-server — genuinely harder¶
The generalized k-server problem (each server lives in its own metric space, and a request is a tuple specifying a target for each server, served when one server reaches its coordinate) and the closely related weighted k-server (servers have different per-unit movement weights) are dramatically harder:
Exponential lower bounds are known. For weighted
k-server even on the uniform metric, the deterministic competitive ratio is at least2^{Ω(k)}(Fiat–Ricklin; Bansal–Eliáš–Koumoutsos and others), and for generalizedk-server on general metrics the achievable ratios are exponential ink. The cleank/2k−1story collapses entirely.
These results are the cautionary boundary of the field: the moment servers are heterogeneous, the problem leaves the polynomial-competitive regime, and intuition from paging and standard k-server fails. They are why "weighted caching is fine but weighted k-server is a monster" is a senior rule of thumb.
| Variant | Best known | Note |
|---|---|---|
Uniform k-server (paging) | k det / H_k rand | tight |
| Weighted paging (weighted star) | k det / O(\log k) rand (primal-dual) | tractable |
General-metric k-server | 2k−1 det (WFA) / O(\mathrm{polylog}\,k) rand | conjecture open at k / O(\log k) |
(h,k)-server | improves with k/h | resource augmentation |
Weighted k-server | 2^{Ω(k)} lower bound | exponential — escapes the polynomial regime |
Generalized k-server | exponential in k | hardest variant |
Learning-Augmented k-Server and MTS¶
The algorithms-with-predictions paradigm — founded on caching (Lykouris–Vassilvitskii; see ../03-paging-and-caching-theory/senior.md) — extends to general k-server and MTS.
The natural prediction is the offline servers' trajectory: at each step, an ML oracle predicts where the k offline-optimal servers should be (their configuration). An algorithm that blindly chases the predicted configuration is consistent — it matches OPT when predictions are perfect — but not robust, since a confidently wrong prediction sends servers chasing a phantom optimum at unbounded cost.
Result (Antoniadis, Coester, Eliáš, Polak, Simon 2020). "Online Metric Algorithms with Untrusted Predictions" (ICML 2020) gives, for MTS and
k-server, a combiner that takes a prediction-following algorithm and a robust online algorithmRand produces, for a confidence parameter, a policy that is(1 + ε)-consistent (near-OPT when predictions are good) andO(ρ_R / ε)-robust (within a constant of the best worst-case ratioρ_Rwhen predictions are adversarial). The template is the general-metric analogue of the marking-backbone combiner for caching.
The mechanism is the standard consistency–robustness combiner: run the predictor-following policy but bound, via a robust shadow algorithm (e.g. WFA or a randomized MTS algorithm), how far the realized cost can drift from the worst-case guarantee. Follow-ups refine the prediction model (predicting the next request rather than the whole offline configuration) and tighten the trade-off. The senior framing matches caching exactly: k-server-with-predictions interpolates between OPT-when-trusted and the classical competitive ratio when-not, with no metric assumptions beyond those the underlying robust algorithm needs.
Worked Piece: Double-Coverage on Trees Is k-Competitive¶
To ground the abstract machinery, here is a full competitive proof on a structured metric: Double-Coverage (DC) is k-competitive on any tree (Chrobak–Larmore 1991), the result the line case specializes. It is the cleanest k-competitive k-server proof and the geometric counterpoint to WFA's metric-obliviousness.
The DC algorithm on a tree¶
A request arrives at a tree point r. Call a server active if it is adjacent to r in the following sense: no other server lies strictly between it and r on the tree (i.e., it can "see" r along the tree without another server blocking the path). DC's move:
Double-Coverage. Move all active servers toward
rsimultaneously, at equal unit speed, until the first one reachesr. Then stop (one server now coversr).
On a path (the line), at most two servers are active — the nearest on the left and the nearest on the right — and they advance toward r together until one arrives, the other having moved an equal distance. On a general tree the active set can be larger (one per subtree direction around r), but the rule is the same: all unblocked servers converge on r at equal speed.
The potential¶
Run DC and OPT in lockstep on the same requests. Let A be DC's configuration and B be OPT's. The potential combines two terms:
whereM_min(A, B)is the minimum-cost matching between DC's servers and OPT's servers, andΣ_{i<j} d(a_i, a_j)is the sum of pairwise distances among DC's own servers.
The first term measures DC's mismatch against OPT (scaled by k); the second is an internal "spread" term that pays for DC's servers moving toward each other. Φ ≥ 0 always.
The step inequality¶
Each request is analyzed in two phases — OPT moves, then DC moves — exactly the potential-method split.
Phase 1 — OPT moves a server to r (cost d_OPT). Only B changes. The matching M_min(A, B) can increase by at most d_OPT (moving one of OPT's servers by d_OPT changes the matching cost by at most that), and the DC-internal term is unchanged. So:
Phase 2 — DC moves its active servers toward r (cost d_DC). This is where the tree geometry pays off. Two sub-cases:
- Exactly one active server moves (e.g., it is the only one that can see
r): it moves distanced_DCtowardr. Becauseris now covered and OPT also has a server atr, this server moves toward its matched OPT server, so the matching termk · M_mindecreases byk · d_DC. The internal spread term changes by at most(k−1) · d_DC(the moving server changes distance to the otherk−1). Net:ΔΦ ≤ −k·d_DC + (k−1)·d_DC = −d_DC. Thend_DC + ΔΦ ≤ d_DC − d_DC = 0 ≤ k·d_OPT. - Two or more active servers move (the typical line case: left and right server each move
δ, sod_DC = 2δcounting both, but only one reachesr): the server that reachesrdecreases the matching term as above; the other active server moving towardris moving toward the cluster, so the internal spread term decreases enough to pay for its motion. The key tree fact — active servers move toward each other throughr's neighborhood — makes the pairwise-distance term drop by exactly enough to cancel the extra movement. The careful accounting yields, again,
The geometric heart is that DC's servers always move toward each other (they converge on r), so the internal spread term Σ d(a_i, a_j) is monotonically decreasing under DC's own moves, funding the motion of the servers that are not the one matched to r.
Summing¶
Combine the two phases per request: d_DC + ΔΦ ≤ k · d_OPT. Telescoping over the whole sequence (Φ starts finite, stays ≥ 0):
DC is k-competitive on trees — matching the lower bound k, hence optimal for trees. The proof is the archetype the conjecture wants to generalize: a matching term (mismatch vs OPT) plus an internal-geometry term (spread of ALG's own servers), with the geometry of the move guaranteeing the internal term funds the parts of the move the matching term cannot. WFA's 2k−1 proof is the metric-oblivious, work-function-based replacement for exactly this geometric argument — and the factor 2k−1 vs k is the price of giving up the tree structure that makes DC's spread term work.
Decision Framework¶
| Situation | Reach for |
|---|---|
| General metric, deterministic, want a provable bound | WFA — (2k−1)-competitive everywhere (Koutsoupias–Papadimitriou); the only metric-oblivious option |
| Tree or line metric | Double-Coverage — k-competitive (optimal), simple and geometric, no work-function machinery |
| Uniform metric (paging) | MARKING / LRU — k det, H_k rand; see ../03-paging-and-caching-theory/senior.md |
| Non-uniform fetch cost (weighted caching) | GREEDYDUAL / Landlord (Young) k det; primal-dual O(\log k) rand (Bansal–Buchbinder–Naor) |
Want O(\mathrm{polylog}\,k) randomized on a general metric | HST embedding + fractional/MTS solver + rounding (Bansal–Buchbinder–Mądry–Naor; Bubeck et al.) |
Servers heterogeneous (weighted / generalized k-server) | Expect exponential ratios — the polynomial-competitive theory does not apply |
| Online algorithm has more servers than the baseline | (h, k)-server resource augmentation — ratio improves with k/h |
| You have an ML oracle for the offline trajectory | Learning-augmented k-server/MTS (Antoniadis et al.) — consistency + robustness combiner |
Want to place k-server in the broader theory | Reduce to MTS on configurations (Borodin–Linial–Saks); 2n−1 det, Θ(\log n/\log\log n) rand |
Research Pointers¶
- Manasse, M., McGeoch, L. A., & Sleator, D. D. (1990). "Competitive Algorithms for Server Problems." J. Algorithms 11(2) (STOC 1988). Defined the
k-server problem, the lower boundk, the conjecture, and thek=2upper bound. - Koutsoupias, E., & Papadimitriou, C. H. (1995). "On the k-Server Conjecture." JACM 42(5). The Work Function Algorithm and the
(2k−1)-competitive theorem — the best general bound known. - Chrobak, M., & Larmore, L. L. (1991). "An Optimal On-line Algorithm for k Servers on Trees." SIAM J. Computing 20(1). Double-Coverage
k-competitive on trees (the worked proof above). - Borodin, A., Linial, N., & Saks, M. (1992). "An Optimal On-line Algorithm for Metrical Task Systems." JACM 39(4). MTS, the
2n−1deterministic tight ratio, and the work-function method that fathered WFA. - Bansal, N., Buchbinder, N., Mądry, A., & Naor, J. (2015). "A Polylogarithmic-Competitive Algorithm for the k-Server Problem." JACM 62(5) (FOCS 2011). First
\mathrm{polylog}-competitive randomizedk-server via fractional HST solving and rounding. - Bubeck, S., Cohen, M. B., Lee, Y. T., Lee, J. R., & Mądry, A. (2018). "k-server via Multiscale Entropic Regularization." STOC.
O(\log^6 k)on HSTs via mirror descent / entropic regularization. - Lee, J. R. (2018). "Fusible HSTs and the Randomized k-Server Conjecture." FOCS. The metric-embedding ingredient delivering polylog-in-
krandomized competitiveness. - Bansal, N., Buchbinder, N., & Naor, J. (2012). "A Primal-Dual Randomized Algorithm for Weighted Paging." JACM 59(4) (FOCS 2007).
O(\log k)randomized weighted paging; the online-LP seed for the general results. - Young, N. E. (1994/2002). "On-line File Caching" / "The k-Server Dual and Loose Competitiveness for Paging." GREEDYDUAL / Landlord —
k-competitive weighted caching. - Bansal, N., Eliáš, M., & Koumoutsos, G. (2017). "Weighted k-Server Bounds via Combinatorial Dichotomies." FOCS. Exponential lower bounds for weighted
k-server. - Antoniadis, A., Coester, C., Eliáš, M., Polak, A., & Simon, B. (2020). "Online Metric Algorithms with Untrusted Predictions." ICML. Learning-augmented
k-server and MTS: consistency + robustness. - Borodin, A., & El-Yaniv, R. (1998). Online Computation and Competitive Analysis. Cambridge Univ. Press. The definitive treatment of
k-server, work functions, DC, and MTS.
Key Takeaways¶
- The Work Function Algorithm is
(2k−1)-competitive on every metric (Koutsoupias–Papadimitriou 1995) — the best known general deterministic bound. WFA servesr_tby moving to the configurationC ∋ r_tminimizingw_t(C) + d(A_{t-1}, C), balancing retrospective optimality against travel inertia. The proof is a potential argument driven by the quasi-convexity and Lipschitz properties of work functions. - The
k-server conjecture (Manasse–McGeoch–Sleator 1988) is still open. Ak-competitive deterministic algorithm is proved for uniform, line, tree metrics and fork=2; for general metrics the bracket is lower boundk≤ truth ≤2k−1(WFA). Closing it is a marquee open problem; WFA is the leading candidate. k-server reduces to Metrical Task Systems (Borodin–Linial–Saks 1992) on its exponential configuration space. MTS has a tight deterministic ratio2n−1(the work-function method, parent of WFA) and randomizedΘ(\log n/\log\log n); this hierarchy frames whatk-server can hope for.- The randomized
k-server conjecture is settled at the polylog level.O(\log^2 k \log n)(Bansal–Buchbinder–Mądry–Naor 2011) andO(\log^6 k)(Bubeck–Cohen–Lee–Lee–Mądry 2018; Lee 2018) come from embedding into HSTs, solving a fractional/MTS relaxation by min-cost flow or entropic mirror descent, then rounding. These are polylog, notk-competitive; the exactO(\log k)target is open. - Special metrics have sharper results. Weighted paging is
kdet /O(\log k)rand (primal-dual, Bansal–Buchbinder–Naor);(h,k)-resource-augmentation softens the ratio; but weighted/generalizedk-server has exponential2^{Ω(k)}lower bounds — heterogeneous servers escape the polynomial-competitive regime entirely. - Learning-augmented
k-server/MTS (Antoniadis et al. 2020) predicts the offline servers' trajectory and combines a prediction-follower with a robust algorithm to get consistency (near-OPT when trusted) and robustness (classical ratio when not) — the general-metric analogue of learning-augmented caching. - Double-Coverage is
k-competitive (optimal) on trees via a matching term plus an internal-spread term whose monotone decrease funds the convergent moves — the geometric archetype that the metric-oblivious WFA replaces, at the cost of2k−1instead ofk.
See also:
./middle.mdfor the definition, the lower boundk, DC on the line, and the conjecture statement ·./professional.mdfor engineering and applied server-movement systems ·../03-paging-and-caching-theory/senior.mdfor the uniform-metric special case andH_k·../01-competitive-analysis/senior.mdfor the potential method, the adversary hierarchy, and Yao's principle
In this topic
- junior
- middle
- senior
- professional