Online Truthful Mechanisms for Multi-sided Markets

Online Truthful Mechanisms for Multi-sided Markets

Our Mechanism

In this section we describe our online mechanism “Observe and Price Mechanism” (). assumes $`|S_c(P, B)| > 0`$, and that there exists a value $`\alpha \in [|S_c(P, B)|^{-1}, 1]`$, known to the mechanism, such that we are guaranteed that:

\frac{u(a)}{|S_c(P, B)|} \leq   \alpha \quad \forall\; a \in A
    \qquad \text{and}   \qquad
    \frac{|P(m)|}{|S_c(P, B)|} \leq \alpha \quad \forall\; m \in M
    \enspace.

In other words, $`\alpha`$ is an upper bound on how large can the capacity of an advertiser or the number of users of a mediator be compared to the size of the optimal assignment $`S_c(P, B)`$. We remind the reader that $`\alpha`$ can be informally understood as a bound on the market importance of every single advertiser or mediator.

A description of is given as Mechanism [mch:OPM]. Notice that Mechanism [mch:OPM] accepts a parameter $`r \in (0, \nicefrac{1}{2}]`$ whose value is specified later. Additionally, Mechanism [mch:OPM] often refers to parameters of the model that are not known to the mechanism, such as the value of an advertiser or the number of users of a mediator. Whenever this happens, this should be understood as referring to the reported values of these parameters.



Draw a random value $`t`$ from the binomial distribution $`\cB(|A| + |M|, r)`$, and observe the first $`t`$ entities that arrive without assigning any users. Let $`A_T`$ and $`M_T`$ be the set of the observed advertisers and mediators, respectively. More formally, if $`T`$ is the set of the first $`t`$ entities that arrived, then $`A_T = A \cap T`$ and $`M_T = M \cap T`$. We later refer to this step of the mechanism as the “observation phase”.

Let $`\hat{p}`$ and $`\hat{b}`$ be the user and slot at location $`\lceil (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))| \rceil`$ of the canonical assignment $`S_c(P(M_T), B(A_T))`$. If $`(1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))| \leq 0`$, then the previous definition of $`\hat{p}`$ and $`\hat{b}`$ cannot be used. Instead define $`\hat{p}`$ as a dummy user of cost $`-\infty`$ and $`\hat{b}`$ as a dummy slot of value $`\infty`$. We say that a slot $`b`$ or a user $`p`$ corresponding to an entity that arrived after the observation phase is assignable if $`v(b) > v(\hat{b})`$ or $`c(p) < c(\hat{p})`$, respectively.

Let $`\sigma_E`$ be the sequence of the entities that arrived so far after the observation phase. Initially $`\sigma_E`$ is empty, and entities are added to it as they arrive.

For every arriving entity:

Add the new entity to the end of $`\sigma_E`$.

If the arriving entity is a mediator $`m`$, then, as long as $`m`$ has unassigned assignable users and there is an advertiser in $`\sigma_E`$ having unassigned assignable slots, do the following:

Let $`a`$ be the earliest advertiser in $`\sigma_E`$ having unassigned assignable slots.

Assign the unassigned assignable user of $`m`$ with the lowest cost to an arbitrary unassigned assignable slot of $`a`$, charge an amount of $`v(\hat{b})`$ from advertiser $`a`$ and pay $`c(\hat{p})`$ to mediator $`m`$.

If the arriving entity is an advertiser $`a`$, then, as long as $`a`$ has unassigned assignable slots and there is a mediator in $`\sigma_E`$ having unassigned assignable users, do the following:

Let $`m`$ be the earliest mediator in $`\sigma_E`$ having unassigned assignable users.

Assign the unassigned assignable user of $`m`$ with the lowest cost to an arbitrary assignable slot of $`a`$, charge an amount of $`v(\hat{b})`$ from advertiser $`a`$ and pay $`c(\hat{p})`$ to mediator $`m`$.

For every mediator $`m \in \sigma_E`$, recommend $`m`$ to transfer his assigned users an additional amount that guarantees the following:

If all the assignable users of $`m`$ are assigned to slots, then the additional amount should increase the total payment received so far by each assigned user of $`m`$ to $`c(\hat{p})`$.

Otherwise, let $`p`$ be the unassigned assignable user of $`m`$ with the minimum cost. In this case the additional amount should increase the total payment received so far by each assigned user of $`m`$ to $`c(p)`$.


We would like to note that is based on a mechanism of  named “Threshold by Partition Mechanism”, and the analyses of both mechanisms go along similar lines. However, introduces additional ideas that allow it to work in an online setting. In particular, uses an involved recommended payments updating rule that keeps it three-sided continuously individually rational. Moreover, is able to use an observation phase whose size is a small fraction of the entire input (for $`\alpha \ll 1`$), whereas the analysis of the original mechanism of  relies on the symmetry properties induced by an even partition of the input (which is inappropriate in an online setting).

Let us start the analysis of with the following simple observation, which shows that obeys the restriction of our model on the way a mechanism may update its assignment.

Each time assigns a user to a slot, either the user belongs to the newly arrived mediator or the slot belongs to the newly arrived advertiser.

Our objective in the rest of this section is to prove Theorem [thm:OPM]. In fact, we prove the following restatement of the theorem, which implies the original statement of the theorem from Section 6.1.

thm:OPM is budget balanced, three-sided continuously individually rational, three-sided incentive compatible and $`(1 - r - 22r^{-1} \cdot \sqrt[3]{\alpha} - 10e^{-2/\sqrt[3]{\alpha}})`$-competitive. Hence, for $`r = \min\{1/2, 4\sqrt[6]{\alpha}\}`$ the competitive ratio of is at least: $`1 - 9.5\sqrt[6]{\alpha} - 10e^{-2/\sqrt[3]{\alpha}}`$.

One part of Theorem [thm:OPM] (, that is budget balanced) is proved by the following observation.

is budget balanced.

Proof. We show that whenever assigns a user $`p`$ to a slot $`b`$, it charges the advertiser of $`b`$ more than it pays the mediator of $`p`$. Consider an arbitrary ordered pair $`(p, b)`$ from the assignment produced by . The fact that $`p`$ is assigned implies that $`c(p) < c(\hat{p})`$, and thus, $`\hat{p}`$ is not a dummy user (since $`c(\hat{p}) = -\infty`$ when $`\hat{p}`$ is a dummy user). Similarly, the fact that a user is assigned to $`b`$ implies that $`v(b) > v(\hat{b})`$, and thus, $`\hat{b}`$ is not a dummy slot (since $`v(\hat{b}) = \infty`$ when $`\hat{b}`$ is a dummy slot).

Recall that the fact that $`\hat{p}`$ and $`\hat{b}`$ are not dummy user and slot, respectively, implies that $`\hat{p}`$ and $`\hat{b}`$ are matched by the canonical assignment $`S_c(P(M_T), B(A_T))`$. Since a canonical assignment never assigns a user $`p'`$ to a slot $`b'`$ when $`c(p') > v(b')`$, we get: $`c(\hat{p}) < v(\hat{b})`$. The proof now completes by observing that the advertiser of $`b`$ is charged $`v(\hat{b})`$ for the assignment of $`p`$ to $`b`$, and the mediator of $`p`$ is paid only $`c(\hat{p})`$ for the assignment of $`p`$ to $`b`$. ◻

Following is a useful observation about that we occasionally use in the next proofs.

preserves the invariant that one of the following is always true immediately after processes the arrival of an entity:

assigned all the assignable users of mediators that have already arrived.

assigned users to all the assignable slots of advertisers that have already arrived.

Proof. Clearly the invariant holds during the observation phase because only mediators and advertisers that arrive after the observation phase contribute assignable users and slots, respectively. Next, assume the invariant held before the arrival of some mediator $`m`$ which arrives after the observation phase, and let us prove that it holds also after the arrival of $`m`$. If before the arrival of $`m`$ case [case:2] of the invariant held, then this case also holds after the arrival of $`m`$ since $`m`$ contributes no new slots. On the other hand, if case [case:1] held before the arrival of $`m`$, then assigns the assignable users of $`m`$ to assignable slots of advertisers that have already arrived till one of two things happen: either all the assignable slots of advertisers that have already arrived get assigned (and thus, case [case:2] of the invariant now holds), or all the assignable users of $`m`$ get assigned (and thus, case [case:1] of the invariant holds again). It remains to prove that if the invariant held before the arrival of an advertiser $`a`$ which arrives after the observation phase, then it also holds after her arrival. However, this proof is analogous to the above proof for mediators, and thus, we omit it. ◻

The Incentive Properties of

In this section we prove the incentive parts of Theorem [thm:OPM]. Specifically, we three lemmata showing that is three-sided continuously individually rational and three-sided incentive compatible. The first lemma analyzes the incentive properties of for users.

For every user $`p`$, assuming the mediator $`m`$ of $`p`$ is truthful, is continuously individually rational for $`p`$, and truthfulness is a dominant strategy for her.

The next lemma analyzes the incentive properties of for mediators.

For every mediator $`m`$, assuming the users of $`m`$ are truthful, is continuously individually rational for $`m`$, and truthfulness is a dominant strategy for him.

Finally, the next lemma considers the incentive properties of for advertisers. The proof of this lemma is analogous to the proof of the previous lemma (with slots exchanging roles with users, $`v(\hat{b})`$ exchanging roles with $`c(\hat{p})`$, etc.), and thus, we omit it.

For every advertiser $`a`$, is continuously individually rational for $`a`$, and truthfulness is a dominant strategy for her.

The Competitive Ratio of

In this section we analyze the competitive ratio of . Recall that $`\tau`$ was defined as a shorthand for $`|S_c(P, B)|`$. We now define $`\tilde{P}`$ ($`\tilde{B}`$) as the set of the users (slots) at locations $`1`$ to $`\lceil (1 - 6r^{-1} \cdot \sqrt[3]{\alpha})\tau \rceil`$ of the canonical assignment $`S_c(P, B)`$ ($`\tilde{P}`$ and $`\tilde{B}`$ are defined to be empty when $`1 - 6r^{-1} \cdot \sqrt[3]{\alpha} \leq 0`$). The following observation shows that most of the gain from trade of the canonical assignment $`S_c(P, B)`$ comes from the users and slots of $`\tilde{P}`$ and $`\tilde{B}`$, respectively. For convenience, let us denote by $`P_o`$ the set of users that are assigned by $`S_c(P, B)`$, and by $`B_o`$ the set of slots that are assigned some user by $`S_c(P, B)`$.

$`\sum_{b \in \tilde{B}} v(b) - \sum_{p \in \tilde{P}} c(p) \geq (1 - 6r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \GfT(S_c(P, B))`$.

Observation [obs:tilde_elements] shows that one can prove a competitive ratio for by relating the gain from trade of the assignment it produces to the gain from trade obtained by assigning the users of $`\tilde{P}`$ to the slots $`\tilde{B}`$. The following lemma is a key lemma we use to relate the two gains. In order to state this lemma we need some additional definitions. Consider the following two sets.

\hat{P} = \{p \in P(M \setminus M_T) \mid c(p) < c(\hat{p})\}
    \qquad \text{and} \qquad
    \hat{B} = \{b \in B(A \setminus A_T) \mid v(b) > v(\hat{b})\}
    \enspace.

Intuitively, $`\hat{P}`$ is the set of the assignable users, and $`\hat{B}`$ is the set of the assignable slots. It is important to note that $`\hat{P}`$ and $`\hat{B}`$ are both empty whenever $`\hat{p}`$ and $`\hat{b}`$ are dummy user and slot, respectively. We also define two additional sets $`A_L`$ and $`M_L`$ as follows. Let $`f`$ be a random variable distributed according to the binomial distribution $`\cB(|A \setminus A_T| + |M \setminus M_T|, \min\{16r^{-1} \cdot \sqrt[3]{\alpha}, 1\})`$, and let $`L`$ be the set of the last $`f`$ entities in $`\sigma_E`$ (or equivalently, the last $`f`$ entities to arrive). The sets $`A_L`$ and $`M_L`$ are then defined as $`A_L = A \cap L`$ and $`M_L = M \cap L`$.

There exists an event $`\cE`$ of probability at least $`1 - 10e^{-2/\sqrt[3]{\alpha}}`$ such that $`\cE`$ implies the following:

(i)  \ B(AT) ⊆  (iii) | \ P(ML)| ≤ ||
(ii)  \ P(MT) ⊆  (iv) | \ B(AL)| ≤ ||
(v) c(p) ≤ (P, B) ≤ v(b) for every user p ∈  and slot b ∈ , where (P, B) is a value which is independent of the random coins of and obeys c(p) ≤ (P, B) ≤ v(b) for every p ∈ Po and b ∈ Bo.

The proof of Lemma [lem:event_summary] is very similar to the proof of Lemma 4.6 in , and thus, we defer it to Appendix 9. In the rest of this section we explain how the competitive ratio of follows from Lemma [lem:event_summary]. Let $`\hat{S}`$ be the assignment produced by .

The event $`\cE`$ implies the following inequality:

\GfT(\hat{S})
    \geq
    \sum_{\substack{b \in \tilde{B} \\ b \not \in B(A_T \cup A_L)}} \mspace{-18mu}[v(b) - \ell(P, B)] + \sum_{\substack{p \in \tilde{P} \\ p \not \in P(M_T \cup M_L)}} \mspace{-18mu}[\ell(P, B) - c(p)]
    \enspace.

Proof. Lemma [lem:event_summary] shows that given $`\cE`$ we have $`|\hat{P} \setminus P(M_L)| \leq |\hat{B}|`$, hence, Observation [obs:invariant] implies that assigns at least $`|\hat{P} \setminus P(M_L)|`$ users. Additionally, since assigns users of mediators from $`M_L`$ only after all the assignable users of mediators from $`M \setminus (M_T \cup M_L)`$ are assigned to slots we get that all the users of $`\hat{P} \setminus P(M_L)`$ are assigned by $`\hat{S}`$ given $`\cE`$. On the other hand, Lemma [lem:event_summary] also shows that given $`\cE`$ all the users of $`\tilde{P} \setminus P(M_T)`$ belong to $`\hat{P}`$, and thus, the users of $`\tilde{P} \setminus P(M_T \cup M_L)`$ are all assigned by $`\hat{S}`$. A similar argument shows that the slots of $`\tilde{B} \setminus B(A_T \cup A_L)`$ are all assigned users by $`\hat{S}`$ given $`\cE`$. Finally, observe that $`\cE`$ also implies that $`c(p) \leq \ell(P, B) \leq v(b)`$ for every pair $`(p, b) \in \hat{S} \subseteq \hat{P} \times \hat{B}`$.

In the rest of the proof we assume that $`\cE`$ happens. Consider an ordered pair $`(p, b) \in \hat{S}`$. Then, the contribution of $`(p, b)`$ to $`\GfT(\hat{S})`$ is:

v(b) - c(p)
    =
    [v(b) - \ell(P, B)] + [\ell(P, B) - c(p)]
    \enspace.

By the above discussion, the two terms that appear in brackets on the right hand side of the last equation are both positive. This allows us to lower bound the gain from trade of $`\hat{S}`$ as follows:

\begin{align*}
    \GfT(\hat{S})
    ={} &
    \sum_{(p, b) \in \hat{S}} [v(b) - c(p)]
    =
    \sum_{(p, b) \in \hat{S}} \{[v(b) - \ell(P, B)] + [\ell(P, B) - c(p)]\}\\
    \geq{} &
    \sum_{\substack{b \in \tilde{B} \\ b \not \in B(A_T \cup A_L)}} \mspace{-18mu}[v(b) - \ell(P, B)] + \sum_{\substack{p \in \tilde{P} \\ p \not \in P(M_T \cup M_L)}} \mspace{-18mu}[\ell(P, B) - c(p)]
    \enspace.
    \qedhere
\end{align*}

 ◻

is at least $`(1 - r - 22r^{-1} \cdot \sqrt[3]{\alpha} - 10e^{-2/\sqrt[3]{\alpha}})`$-competitive.

Proof. The corollary is trivial when $`r + 22r^{-1} \cdot \sqrt[3]{\alpha} + 10e^{-2/\sqrt[3]{\alpha}} > 1`$. Thus, we assume in this proof $`r + 22r^{-1} \cdot \sqrt[3]{\alpha} + 10e^{-2/\sqrt[3]{\alpha}} \leq 1`$. For every two sets $`M' \subseteq M`$ and $`A' \subseteq A`$ of mediators and advertisers, respectively, let $`\Val(M', A')`$ denote the expression:

\sum_{b \in \tilde{B} \setminus B(A')} [v(b) - \ell(P, B)] + \sum_{p \in \tilde{P} \setminus P(M')} [\ell(P, B) - c(p)]
    \enspace.

The definition of $`\ell(P, B)`$ guarantees that $`v(b) - \ell(P, B) \geq 0`$ and $`\ell(P, B) - c(p) \geq 0`$ for every $`b \in \tilde{B} \subseteq B_o`$ and $`p \in \tilde{P} \subseteq P_o`$. Thus, $`\Val(M', A') \leq \Val(\varnothing, \varnothing)`$ for every two sets $`M' \subseteq M`$ and $`A' \subseteq A`$. Additionally, it is well-known that the way $`t`$ is chosen guarantees that every entity of $`M \cup A`$ belongs to $`T`$ with probability $`r`$, independently. A proof of this fact can be found, , as Lemma A.1 in . Similarly, every entity of $`M \cup A`$ that does not belong to $`T`$ is added to $`L`$ with probability $`\min\{1, 16r^{-1} \cdot \sqrt[3]{\alpha}\} = 16r^{-1} \cdot \sqrt[3]{\alpha}`$, independently. Hence, every user (slot) of $`\tilde{P}`$ ($`\tilde{B}`$) belongs to $`\tilde{P} \setminus P(M_T \cup M_L)`$ ($`\tilde{B} \setminus B(A_T \cup A_L)`$) with probability

(1 - r)(1 - 16r^{-1} \cdot \sqrt[3]{\alpha})
    \geq
    1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}
    \enspace.

Therefore,

\begin{align*}
    \bE[\Val(M_T \cup M_L, A_T \cup A_L)]
    \geq{} &
    (1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \sum_{b \in \tilde{B}} [v(b) - \ell(P, B)] \\ &+ (1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \sum_{p \in \tilde{P}} [\ell(P, B) - c(b)]\\
    ={} &
    (1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \Val(\varnothing, \varnothing)
    \enspace.
\end{align*}

Using Lemma [lem:joined_event] and the observation that always produces assignments of non-negative gain from trade, we now get:

\begin{align}
 \label{eq:gain_from_trade_hat_S}
    \bE[\GfT(\hat{S})]
    ={} &
    \Pr[\cE] \cdot \bE[\GfT(\hat{S}) \mid \cE] + \Pr[\neg \cE] \cdot \bE[\GfT(\hat{S}) \mid \neg \cE]\nonumber\\
    \geq{} &
    \Pr[\cE] \cdot \bE[\Val(M_T \cup M_L, A_T \cup A_L) \mid \cE]\nonumber\\
    ={} &
    \bE[\Val(M_T \cup M_L, A_T \cup A_L)] - \Pr[\neg \cE] \cdot \bE[\Val(M_T \cup M_L, A_T \cup A_L) \mid \neg \cE]\nonumber\\
    \geq{} &
    (1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \Val(\varnothing, \varnothing) - \Pr[\neg \cE] \cdot \Val(\varnothing, \varnothing)\nonumber\\
    ={} &
    [(1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}) - \Pr[\neg \cE]] \cdot \Val(\varnothing, \varnothing)
    \enspace.
\end{align}

Recall that $`\Pr[\neg \cE] \leq 10e^{-2/\sqrt[3]{\alpha}}`$ by Lemma [lem:event_summary]. Additionally, Observation [obs:tilde_elements] and the fact that $`|\tilde{P}| = |\tilde{B}|`$ by definition imply together:

\begin{align*}
    \Val(\varnothing, \varnothing)
    ={} &
    \sum_{b \in \tilde{B}} [v(b) - \ell(P, B)] + \sum_{p \in \tilde{P}} [\ell(P, B) - c(p)]\\
    ={} &
    \sum_{b \in \tilde{B}} v(b) - \sum_{p \in \tilde{P}} c(p)
    \geq
    (1 - 6r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \GfT(S_c(P, A))
    \enspace.
\end{align*}

Plugging the last observations into Inequality [eq:gain_from_trade_hat_S] gives:

\begin{align*}
    \bE[\GfT(\hat{S})]
    \geq{} &
    [(1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}) - \Pr[\neg \cE]] \cdot \Val(\varnothing, \varnothing)\\
    \geq{} &
    [(1 - r - 16r^{-1} \cdot \sqrt[3]{\alpha}) - 10e^{-2/\sqrt[3]{\alpha}}] \cdot (1 - 6r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \GfT(S_c(P, A))\\
    \geq{} &
    (1 - r - 22r^{-1} \cdot \sqrt[3]{\alpha} - 10e^{-2/\sqrt[3]{\alpha}}) \cdot \GfT(S_c(P, B))
    \enspace.
\end{align*}

The corollary now follows by recalling that $`S_c(P, B)`$ is the assignment of users from $`P`$ to slots of $`B`$ which maximizes the gain from trade. ◻

Missing Proofs

lem:incentives_for_users For every user $`p`$, assuming the mediator $`m`$ of $`p`$ is truthful, is continuously individually rational for $`p`$, and truthfulness is a dominant strategy for her.

lem:incentives_for_mediators For every mediator $`m`$, assuming the users of $`m`$ are truthful, is continuously individually rational for $`m`$, and truthfulness is a dominant strategy for him.

obs:tilde_elements $`\sum_{b \in \tilde{B}} v(b) - \sum_{p \in \tilde{P}} c(p) \geq (1 - 6r^{-1} \cdot \sqrt[3]{\alpha}) \cdot \GfT(S_c(P, B))`$.

Proof of Lemma <a href="#lem:event_summary" data-reference-type=“ref”

data-reference=“lem:event_summary”>[lem:event_summary]

In this section we prove Lemma [lem:event_summary]. Let us begin the proof with the following technical lemma (this lemma is identical to Lemma 4.9 in . We repeat its proof here for completeness).

Given a subset $`B' \subseteq B_o`$ and a probability $`q \in [0, 1]`$, let $`B'[q]`$ be a random subset of $`B'`$ constructed as follows: for every advertiser $`a \in A`$, independently, with probability $`q`$ the slots of advertiser $`a`$ that belong to $`B'`$ appear also in $`B'[q]`$. Then, for every $`\beta \in (0, 1]`$:

\Pr[||B'[q]| - q \cdot |B'|| \geq \beta \tau]
    \leq
    2e^{-2\beta^2/\alpha}
    \enspace.

Similarly, given a subset $`P' \subseteq P_o`$ and a probability $`q \in [0, 1]`$, let $`P'[q]`$ be a random subset of $`P'`$ constructed as follows: for every mediator $`m \in M`$, independently, with probability $`q`$ the users of mediator $`m`$ that belong to $`P'`$ appear also in $`P'[q]`$. Then, for every $`\beta \in (0, 1]`$:

\Pr[||P'[q]| - q \cdot |P'|| \geq \beta \tau]
    \leq
    2e^{-2\beta^2/\alpha}
    \enspace.

Proof. We prove the first inequality; the second inequality is analogous. First, observe that the lemma is trivial when $`B' = \varnothing`$ since $`B' = \varnothing`$ implies $`||B'[q]| - q \cdot |B'|| = 0 < \beta \tau`$. Thus, we may assume in the rest of the proof $`B' \neq \varnothing`$. For every advertiser $`a \in A`$, let $`X_a`$ be an indicator for the event that slots of $`a`$ appear in $`B'[q]`$. Then:

|B'[q]|
    =
    \sum_{a \in A} X_a \cdot |B' \cap B(a)|
    \enspace.

The definition of $`\alpha`$ implies $`|B(a)| \leq \alpha \tau`$ for every advertiser $`a \in A`$, and thus, $`0 \leq |B' \cap B(a)| \leq \alpha \tau`$. Hence, by Hoeffding’s inequality:

\begin{align*}
    \Pr[||B'[q]| - q \cdot |B'|| \geq \beta \tau]
    ={} &
    \Pr[||B'[q]| - \bE[|B'[q]|]| \geq \beta \tau]
    \leq
    2e^{-\frac{2(\beta \tau)^2}{\sum_{a \in A} |B' \cap B(a)|^2}}\\
    \leq{} &
    2e^{-\frac{2(\beta \tau)^2}{\alpha \tau \cdot \sum_{a \in A} |B' \cap B(a)|}}
    =
    2e^{-\frac{2\beta^2 \tau}{\alpha \cdot |B'|}}
    \leq
    2e^{-\frac{2\beta^2 \tau}{\alpha \cdot |B_o|}}
    =
    2e^{-\frac{2\beta^2}{\alpha}}
    \enspace.
    \qedhere
\end{align*}

 ◻

Let $`\cE'`$ be the event that the following inequalities are all true (at the same time):

(i) $` B_o \cap B(A_T)
(ii) $` P_o \cap P(M_T)

$`\Pr[\cE'] \geq 1 - 8e^{-2/\sqrt[3]{\alpha}}`$.

Proof. As explained in the proof of Corollary [cor:OPM_competitive_ratio], $`T`$ contains every entity of $`M \cup A`$ with probability $`r`$, independently. This means that $`B_o \cap B(A_T)`$, $`\tilde{B} \cap B(A_T)`$, $`P_o \cap P(M_T)`$ and $`\tilde{P} \cap P(M_T)`$ have the same distributions as $`B_o[r]`$, $`\tilde{B}[r]`$, $`P_o[r]`$ and $`\tilde{P}[r]`$, respectively. Moreover, by definition, $`\tilde{B} \subseteq B_o`$ and $`\tilde{P} \subseteq P_o`$. Hence, by Lemma [lem:length_concentration], each one of the four inequalities defining $`\cE'`$ holds with probability at least $`1 - 2e^{-2/\sqrt[3]{\alpha}}`$. The observation now follows by the union bound. ◻

Next, we need the following useful observation (this observation is analogous to Observation 4.11 in , and both observations share identical proofs. We repeat the proof here for completeness).

It always holds that:

\begin{align*}
    \min\{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\}
    \leq\inConference{{} &}
    |S_c(P(M_T), B(A_T))|\inConference{\\}
    \leq\inConference{{} &}
    \max \{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\}
    \enspace.
\end{align*}

Proof. Let $`p_\tau`$ and $`b_\tau`$ be the user and slot at location $`\tau`$ of $`S_c(P, B)`$, respectively. The definition of a canonical assignment guarantees that we have $`c(p_{\tau}) < v(b_{\tau})`$. Additionally, the slots of $`B_o`$ all appear in locations $`1`$ to $`\tau`$ of $`S_c(P, B)`$, and thus, they all have values at least as large as $`v(b_\tau)`$. Similarly, the users of $`P_o`$ all have costs at most as large as $`c(p_\tau)`$. Combining these observations, we get: $`c(p) \leq c(p_\tau) < v(b_\tau) \leq v(b)`$ for every $`p \in P_o`$ and $`b \in B_o`$.

The slots at locations $`1`$ to $`|B_o \cap B(A_T)|`$ of $`S_c(P(M_T), B(A_T))`$ all belong to $`B_o`$ since $`B_o`$ contains the $`\tau`$ slots with the largest values. Similarly, the users at locations $`1`$ to $`|P_o \cap P(M_T)|`$ belong to $`P_o`$. Combining both observations, we get that for every location $`1 \leq i \leq \min\{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\}`$, the user $`p'_i`$ at location $`i`$ of $`S_c(P(M_T), B(A_T))`$ and the slot $`b'_i`$ at this location belong to $`P_o`$ and $`B_o`$, respectively, and thus, $`c(p'_i) < v(b'_i)`$. Hence, by the definition of a canonical assignment, the pair $`(p'_i, b'_i)`$ belongs to $`S_c(P(M_T), B(A_T))`$ for every $`1 \leq i \leq \min\{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\}`$; which completes the proof of the first inequality we need to prove.

Assume towards a contradiction that the second inequality we need to prove is wrong. In other words, we assume $`|S_c(P(M_T), B(A_T))| > \max \{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\}`$. Let $`j = \max \{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\} + 1`$, and let $`p'_j`$ and $`b'_j`$ be the user and slot at location $`j`$ of $`S_c(P(M_T), B(A_T))`$, respectively. Our assumption implies that $`(p'_j , b'_j)`$ belongs to $`S_c(P(M_T), B(A_T))`$, and thus, $`c(p'_j) < v(b'_j)`$. On the other hand, only the users at locations $`1`$ to $`|P_o \cap P(M_T)|`$ of $`S_c(P(M_T), B(A_T))`$ belong to $`P_o`$, hence, $`p'_j`$ does not belong to $`P_o`$. The user with the lowest cost that does not belong to $`P_o`$ is the user $`p_{\tau + 1}`$ at location $`\tau + 1`$ of $`S_c(P, B)`$. Thus, we get: $`c(p'_j) \geq c(p_{\tau + 1})`$. Similarly, we can also get $`v(b'_j) \leq v(b_{\tau + 1})`$, where $`b_{\tau + 1}`$ is the slot at location $`\tau + 1`$ of $`S_c(P, B)`$. Combining the above inequalities gives:

c(p_{\tau + 1})
    \leq
    c(p'_j)
    <
    v(p'_j)
    \leq
    v(b_{\tau + 1})
    \enspace,

which contradicts the fact that $`p_{\tau + 1}`$ is not assigned to $`b_{\tau + 1}`$ by the canonical assignment $`S_c(P, B)`$. ◻

The next few claims use the last observation to prove a few properties that hold given $`\cE'`$.

The event $`\cE'`$ implies: $`\hat{P} \subseteq P_o`$ and $`\hat{B} \subseteq B_o`$.

Proof. We prove the first inclusion. The other inclusion is analogous. Observation [obs:length_characterization] and the definition of $`\cE'`$ imply:

\begin{align*}
    |S_c(P(M_T), B(A_T))|
    \leq{} &
    \max \{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\}\\
    \leq{} &
    \max\{r \cdot |P_o| + \sqrt[3]{\alpha} \cdot \tau, r \cdot |B_o|  + \sqrt[3]{\alpha} \cdot \tau\}
    =
    (r + \sqrt[3]{\alpha})\tau
    \enspace.
\end{align*}

Using the definition of $`\cE'`$ again gives:

\begin{align*}
    |P_o \cap P(M_T)|
    \geq{} &
    r \cdot |P_o| - \sqrt[3]{\alpha} \cdot \tau
    =
    (r - \sqrt[3]{\alpha})\tau\\
    \geq{} &
    (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot (r + \sqrt[3]{\alpha})\tau
    \geq
    (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))|
    \enspace,
\end{align*}

which implies, since $`|P_o \cap P(M_T)|`$ is integral,

\begin{equation}
 \label{eq:threshold_in_original}
    |P_o \cap P(M_T)|
    \geq
    \lceil (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))| \rceil
    \enspace.
\end{equation}

If $`\hat{p}`$ is a dummy user then $`\hat{P}`$ is empty, which makes the claim $`\hat{P} \subseteq P_o`$ trivial. Thus, we may assume that $`\hat{p}`$ is the user at location $`\lceil (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))| \rceil`$ of the canonical assignment $`S_c(P(M_T), B(A_T))`$. Hence, Inequality [eq:threshold_in_original] and the observation that the users of $`P_o \cap P(M_T)`$ are the users with the lowest costs in $`P(M_T)`$ imply together that $`\hat{p}`$ belongs to the set $`P_o \cap P(M_T) \subseteq P_o`$. On the other hand, $`P_o`$ contains the $`\tau`$ users with the lowest costs. Hence, every user with a cost lower than $`c(\hat{p})`$ must be in $`P_o`$ since $`\hat{p}`$ is in $`P_o`$. The lemma now follows by observing that the definition of $`\hat{P}`$ implies $`c(p) < c(\hat{p})`$ for every user $`p \in \hat{P}`$. ◻

There exists a value $`\ell(P, B)`$ independent of the random coins of such that:

$`c(p) \leq \ell(P, B) \leq v(b)`$ for every user $`p \in P_o`$ and slot $`b \in B_o`$

Whenever the event $`\cE'`$ occurs, $`c(p) \leq \ell(P, B) \leq v(b)`$ for every user $`p \in \hat{P}`$ and slot $`b \in \hat{B}`$.

Proof. Let $`\ell(P, B)`$ be the value of the slot at location $`\tau`$ of the canonical assignment $`S_c(P, B)`$. Clearly, $`\ell(P, B)`$ is independent of the random coins of , as required. Additionally, for every slot $`b \in B_o`$ it holds that $`v(b) \geq \ell(P, B)`$ since $`b`$ must be located at some location of $`S_c(P, B)`$ between $`1`$ and $`\tau`$. On the other hand, let $`p_\tau`$ be the user at location $`\tau`$ of $`S_c(P, B)`$. Since the size of $`S_c(P, B)`$ is $`\tau`$, $`p_\tau`$ must be assigned to the slot at location $`\tau`$ of $`S_c(P, B)`$, which implies $`c(p_{\tau}) \leq \ell(P, B)`$.1 Moreover, for every user $`p \in P_o`$ it holds that $`c(p) \leq c(p_\tau) \leq \ell(P, B)`$ since $`p`$ must be located at some location of $`S_c(P, B)`$ between $`1`$ and $`\tau`$.

The corollary now follows since Lemma [lem:upper_inclusion] shows that the event $`\cE'`$ implies that every user $`p \in \hat{P}`$ belongs also to $`P_o`$, and every slot $`b \in \hat{B}`$ belongs also to $`B_o`$. ◻

The event $`\cE'`$ implies: $`\tilde{P} \setminus P(M_T) \subseteq \hat{P}`$ and $`\tilde{B} \setminus B(A_T) \subseteq \hat{B}`$.

Proof. We prove the first inclusion. The other inclusion is analogous. The claim about $`\tilde{P} \cap P(M \setminus M_T)`$ is trivial when $`\tilde{P}`$ is empty. Thus, we can assume throughout the proof that $`\tilde{P}`$ is non-empty. Observation [obs:length_characterization] and the definition of $`\cE'`$ imply:

\begin{align*}
    |S_c(P(M_T), B(A_T))|
    \geq{} &
    \min \{|P_o \cap P(M_T)|, |B_o \cap B(A_T)|\}\\
    \geq{} &
    \min\{r \cdot |P_o| - \sqrt[3]{\alpha} \cdot \tau, r \cdot |B_o| - \sqrt[3]{\alpha} \cdot \tau\}
    =
    (r - \sqrt[3]{\alpha})\tau
    \enspace.
\end{align*}

Recall that $`\alpha \geq \tau^{-1}`$, and thus, $`\sqrt[3]{\alpha} \cdot \tau \geq 1`$. Using this inequality and the definitions of $`\cE'`$ and $`\tilde{P}`$ now gives:

\begin{align*}
    |\tilde{P} \cap P(M_T)|
    \leq{} &
    r \cdot |\tilde{P}| + \sqrt[3]{\alpha} \cdot \tau
    =
    r \cdot \lceil (1 - 6r^{-1} \cdot \sqrt[3]{\alpha})\tau \rceil + \sqrt[3]{\alpha} \cdot \tau\inConference{\\}
    \leq\inConference{{} &}
    (r - 4\sqrt[3]{\alpha})\tau - 1 + \sqrt[3]{\alpha} \cdot \tau\inArXiv{\\}
    =\inArXiv{{} &}
    (r - 3\sqrt[3]{\alpha})\tau - 1\inConference{\\}
    \leq\inConference{{} &}
    (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot (r - \sqrt[3]{\alpha})\tau - 1\\
    \leq{} &
    (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))| - 1
    \enspace,
\end{align*}

which implies, since $`|\tilde{P} \cap P(M_T)|`$ is integral,

\begin{align}
 \label{eq:threshold_above_tilde}
    |\tilde{P} \cap P(M_T)|
    \leq{} &
    \lceil (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))| \rceil - 1\\
    <{} &
    \lceil (1 - 2r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |S_c(P(M_T), B(A_T))| \rceil \nonumber
    \enspace.
\end{align}

Inequality [eq:threshold_above_tilde] and the observation that the users of $`\tilde{P} \cap P(M_T)`$ are the users with the lowest costs in $`P(M_T)`$ imply together that $`\hat{p}`$ is a user of $`P(M_T)`$ which does not belong to $`\tilde{P} \cap P(M_T)`$, and therefore, does not belong to $`\tilde{P}`$ either. On the other hand, $`\tilde{P}`$ contains the $`\lceil (1 - 6r^{-1} \cdot \sqrt[3]{\alpha})\tau \rceil`$ users with the lowest costs. Hence, every user $`p \in \tilde{P}`$ has a cost smaller than $`c(\hat{p})`$ since $`\hat{p}`$ does not belong to $`\tilde{P}`$. The lemma now follows by observing that the definition of $`\hat{P}`$ implies that $`p \in \hat{P}`$ for every user $`p \in P(M \setminus M_T)`$ obeying $`c(p) < c(\hat{p})`$. ◻

The event $`\cE'`$ implies: $`-7r^{-1} \cdot \sqrt[3]{\alpha} \cdot \tau \leq |\hat{P}| - (1 - r)\tau \leq \sqrt[3]{\alpha} \cdot \tau`$ and $`-7r^{-1} \cdot \sqrt[3]{\alpha} \cdot \tau \leq |\hat{B}| - (1 - r)\tau \leq \sqrt[3]{\alpha} \cdot \tau`$.

Proof. We prove here only the bounds on the size of $`\hat{P}`$. The bounds on the size of $`\hat{B}`$ are analogous. By Lemma [lem:upper_inclusion], $`\hat{P} \subseteq P_o`$. On the other hand, by definition, $`\hat{P} \subseteq P(M \setminus M_T)`$. Thus, we get: $`\hat{P} \subseteq P_o \setminus P(M_T)`$. Combining this inclusion with the definition of $`\cE'`$ gives:

\begin{align*}
    |\hat{P}|
    \leq{} &
    |P_o \setminus P(M_T)|
    =
    |P_o| - |P_o \cap P(M_T)|\\
    \leq{} &
    |P_o| - [r \cdot |P_o| - \sqrt[3]{\alpha} \cdot \tau]
    =
    (1 - r) \cdot |P_o| + \sqrt[3]{\alpha} \cdot \tau
    =
    (1 - r)\tau + \sqrt[3]{\alpha} \cdot \tau
    \enspace.
\end{align*}

On the other hand, by Lemma [lem:lower_inclusion] and the definition of $`\cE'`$,

\begin{align*}
    |\hat{P}|
    \geq{} &
    |\tilde{P} \setminus P(M_T)|
    =
    |\tilde{P}| - |\tilde{P} \cap P(M_T)|
    \geq
    |\tilde{P}| - [r \cdot |\tilde{P}| + \sqrt[3]{\alpha} \cdot \tau]\\
    ={} &
    (1 - r) \cdot |\tilde{P}| - \sqrt[3]{\alpha} \cdot \tau
    \geq
    (1 - r) \cdot \lceil (1 - 6r^{-1} \cdot \sqrt[3]{\alpha})\tau \rceil - \sqrt[3]{\alpha} \cdot \tau\inConference{\\}
    \geq\inConference{{} &}
    (1 - r)\tau - 7r^{-1} \cdot \sqrt[3]{\alpha} \cdot \tau
    \enspace.
    \qedhere
\end{align*}

 ◻

We can now define the event $`\cE`$ referred to by Lemma [lem:event_summary]. The event $`\cE`$ is the event that $`\cE'`$ happens and in addition the following two inequalities also hold:

(i) $` \hat{B} \setminus B(A_L) \leq

We are now ready to prove Lemma [lem:event_summary]. For ease of the reading, we first repeat the lemma itself.

lem:event_summary There exists an event $`\cE`$ of probability at least $`1 - 10e^{-2/\sqrt[3]{\alpha}}`$ such that $`\cE`$ implies the following:

(i)  \ B(AT) ⊆  (iii) | \ P(ML)| ≤ ||
(ii)  \ P(MT) ⊆  (iv) | \ B(AL)| ≤ ||
(v) c(p) ≤ (P, B) ≤ v(b) for every user p ∈  and slot b ∈ , where (P, B) is a value which is independent of the random coins of and obeys c(p) ≤ (P, B) ≤ v(b) for every p ∈ Po and b ∈ Bo.

Proof. By definition, the event $`\cE`$ implies the inequalities: $`|\hat{B} \setminus B(A_L)| \leq |\hat{P}|`$ and $`|\hat{P} \setminus P(M_L)| \leq |\hat{B}|`$. Additionally, $`\cE`$ implies the event $`\cE'`$, which, by Corollary [cor:middle_value] and Lemma [lem:lower_inclusion], implies the other things that should follow from $`\cE`$ by the lemma. Hence, the only thing left to prove is that the probability of $`\cE`$ is at least $`1 - 10e^{-2/\sqrt[3]{\alpha}}`$.

If $`16r^{-1} \cdot \sqrt[3]{\alpha} \geq 1`$, then $`L`$ contains all the entities arriving after the observation phase, which implies $`A_L = A \setminus A_T`$ and $`M_L = M \setminus M_T`$; and thus, the two inequalities $`|\hat{B} \setminus B(A_L)| \leq |\hat{P}|`$ and $`|\hat{P} \setminus P(M_L)| \leq |\hat{B}|`$ are trivial in this case. Hence, the events $`\cE`$ and $`\cE'`$ are equivalent when $`16r^{-1} \cdot \sqrt[3]{\alpha} \geq 1`$, and therefore, the probability of $`\cE`$ is at least $`1 - 8e^{-2/\sqrt[3]{\alpha}}`$ by Observation [obs:E_prime_probability]. Thus, it is safe to assume in the rest of the proof that $`16r^{-1} \cdot \sqrt[3]{\alpha} < 1`$.

Our plan is to prove the inequality $`\Pr[\cE \mid \cE'] \geq 1 - 2e^{-2/\sqrt[3]{\alpha}}`$. Notice that this inequality indeed implies the lemma since it implies:

\Pr[\cE]
    =
    \Pr[\cE'] \cdot \Pr[\cE \mid \cE']
    \geq
    (1 - 8e^{-2/\sqrt[3]{\alpha}}) \cdot (1 - 2e^{-2/\sqrt[3]{\alpha}})
    \geq
    1 - 10e^{-2/\sqrt[3]{\alpha}}
    \enspace.

The event $`\cE'`$ is fully determined by the sets $`M_T`$ and $`A_T`$. Thus, it is enough to show that for every fixed choice of these sets for which the event $`\cE'`$ holds, the event $`\cE`$ holds with probability at least $`1 - 2e^{-2/\sqrt[3]{\alpha}}`$. Notice that the sets $`\hat{P}`$ and $`\hat{B}`$ become deterministic once we fix the choice of $`M_T`$ and $`A_T`$. Hence, either $`|\hat{B}| \leq |\hat{P}|`$, which implies that the inequality $`|\hat{B} \setminus B(A_L)| \leq |\hat{P}|`$ holds regardless of the choice of $`A_L`$, or $`|\hat{P}| \leq |\hat{B}|`$, which implies that the inequality $`|\hat{P} \setminus P(M_L)| \leq |\hat{B}|`$ holds regardless of the choice of $`M_L`$. In both cases, all we need to show is that the other inequality holds with probability at least $`1 - 2e^{-2/\sqrt[3]{\alpha}}`$ over the random choice of $`A_L`$ and $`M_L`$.

Let us assume, without loss of generality, that $`|\hat{B}| \leq |\hat{P}|`$. By the above discussion, all we need to prove is that $`\Pr[|\hat{P} \setminus P(M_L)| \leq |\hat{B}|] \geq 1 - 2e^{-2/\sqrt[3]{\alpha}}`$, where the probability is over the random choice of $`M_L`$. By Corollary [cor:size_bounds]:

\begin{align*}
    \Pr[|\hat{P} \setminus P(M_L)| > |\hat{B}|]\inConference{&{}}
    \leq\inArXiv{{} &}
    \Pr[|\hat{P} \setminus P(M_L)| > (1 - r)\tau - 7r^{-1} \cdot \sqrt[3]{\alpha} \cdot \tau]\\
    \leq{} &
    \Pr[|\hat{P} \setminus P(M_L)| > (1 - 16r^{-1} \cdot \sqrt[3]{\alpha})((1 - r)\tau + \sqrt[3]{\alpha} \cdot \tau) + \sqrt[3]{\alpha} \cdot \tau]\\
    \leq{} &
    \Pr[|\hat{P} \setminus P(M_L)| > (1 - 16r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |\hat{P}| + \sqrt[3]{\alpha} \cdot \tau]
    \enspace.
\end{align*}

As explained in the proof of Corollary [cor:OPM_competitive_ratio], $`M_L`$ contains every mediator of $`M \setminus M_T`$ with probability $`16r^{-1} \cdot \sqrt[3]{\alpha}`$, independently. Hence, $`\hat{P} \setminus P(M_L)`$ has the same distribution as $`\hat{P}[1 - 16r^{-1} \cdot \sqrt[3]{\alpha}]`$. Therefore, by Lemma [lem:length_concentration]:

\begin{align*}
    \Pr[|\hat{P} \setminus P(M_L)| > |\hat{B}|]
    \leq{} &
    \Pr[|\hat{P} \setminus P(M_L)| > (1 - 16r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |\hat{P}| + \sqrt[3]{\alpha} \cdot \tau]\\
    \leq{} &
    \Pr[||\hat{P}(1 - 16r^{-1} \cdot \sqrt[3]{\alpha})| - (1 - 16r^{-1} \cdot \sqrt[3]{\alpha}) \cdot |\hat{P}|| > \sqrt[3]{\alpha} \cdot \tau]\\
    \leq{} &
    2e^{-2/\sqrt[3]{\alpha}}
    \enspace.
    \qedhere
\end{align*}

 ◻

Model and Definitions

Let us now present the exact details of the model we consider. The model consists of a set $`P`$ of users, a set $`M`$ of mediators, and a set $`A`$ of advertisers. Each user $`p \in P`$ has a non-negative cost $`c(p)`$ which she suffers if she is assigned to an advertiser; thus, the utility of $`p`$ is $`0`$ if she is not assigned and $`t - c(p)`$ if she is assigned and paid $`t`$. The users are partitioned among the mediators, and we denote by $`P(m) \subseteq P`$ the set of users associated with mediator $`m \in M`$ (, the sets $`\{P(m) \mid m \in M\}`$ form a disjoint partition of $`P`$). The utility of a mediator $`m \in M`$ is the amount he is paid minus the total cost his users suffer; hence, if $`x(p) \in \{0, 1\}`$ is an indicator for the event that user $`p \in P(m)`$ is assigned and $`t`$ is the payment received by $`m`$ (part of which might have been forwarded by the mediator to his users), then the utility of $`m`$ is $`t - \sum_{p \in P(m)} x(p) \cdot c(p)`$. Finally, each advertiser $`a \in A`$ has a positive capacity $`u(a)`$, and she gains a non-negative value $`v(a)`$ from every one of the first $`u(a)`$ users assigned to her; thus, if advertiser $`a`$ is assigned $`n \leq u(a)`$ users and has to pay $`t`$ then her utility is $`n \cdot v(a) - t`$.

As explained in Section 6, we assume the entities (, the mediators and advertisers) arrive at a uniformly random order. A mechanism for this model knows the total number of entities2, and views the entities as they arrive; however, it has no prior knowledge about the parameters of the entities or about the users. To compensate for this lack of knowledge, each arriving entity reports information to the mechanism. Each advertiser reports her capacity and value. The reports of the mediators are formed in a slightly more involved way. Each user reports her cost to her mediator, and based on these reports each mediator reports the number of his users and their costs to the mechanism. The users, mediators and advertisers are all strategic, and thus, free to produce incorrect reports. In other words, an advertiser may report incorrect capacity and value, a user may report an incorrect cost and a mediator may report any number of users and associate with each one of them an arbitrary cost.

Every time that a new entity arrives, the mechanism has an opportunity to assign users to advertisers. More specifically, when a mediator arrives the mechanism is allowed to assign users of the newly arriving mediator to advertisers that have already arrived. Similarly, when an advertiser arrives the mechanism is allowed to assign users of mediators that have already arrived to the newly arriving advertiser. The objective of the mechanism is to end up with an assignment of users to advertisers maximizing the gain from trade. In order to incentivize the mediators and advertisers to report truthfully, the mechanism may charge the advertisers and pay the mediators. Additionally, the mechanism is also allowed to recommend for each mediator how much of the payment he received to forward to each one of his user. It is important to observe that the utility function of the mediators is not affected by the forwarding of payments to the users, and thus, it is reasonable to believe that mediators follow the forwarding recommendations.

We say that a user is truthful if she reports her true cost. Similarly, an advertiser is truthful if she reports her true capacity and value. Finally, a mediator is considered truthful if he reports to the mechanism his true number of users and the costs of the users as reported to him; and, in addition, he also pays the users according to the recommendation of the mechanism (in other words, he lets them know about their true balance).

Similarly to the practice of , we associate a set $`B(a)`$ of $`u(a)`$ slots with each advertiser $`a \in A`$. This allows us to think of the users as assigned to slots instead of directly to advertisers. Formally, let $`B`$ be the set of all slots (, $`B = \bigcup_{a \in A} B(a)`$), then an assignment is a set $`S \subseteq B \times P`$ in which no user or slot appears in more than one ordered pair. We say that an assignment $`S`$ assigns a user $`p`$ to slot $`b`$ if $`(p, b) \in S`$. Similarly, we say that an assignment $`S`$ assigns user $`p`$ to advertiser $`a`$ if there exists a slot $`b \in B(a)`$ such that $`(p, b) \in S`$. It is also useful to define values for the slots. For every slot $`b`$ of advertiser $`a`$, we define the value $`v(b)`$ of $`b`$ as equal to the value $`v(a)`$ of $`a`$. Using this notation, the gain from trade of assignment $`S`$ can be stated as:

\GfT(S)
    =
    \sum_{(p, b) \in S} [v(b) - c(p)]
    \enspace.

Finally, we would like to define two additional shorthands that we use occasionally. Given a set $`A' \subseteq A`$ of advertisers, we denote by $`B(A') = \bigcup_{a \in A'} B(a)`$ the set of slots belonging to advertisers of $`A'`$. Similarly, given a set $`M' \subseteq M`$ of mediators, $`P(M') = \bigcup_{m \in M'} P(m)`$ is the set of users associated with mediators of $`M'`$.

Comparison of Costs and Values

The presentation of our mechanism is simpler when the values of slots and the costs of users are all unique. Clearly, this is extremely unrealistic since all the slots of a given advertiser have the exact same value in our model. Thus, we simulate uniqueness using a tie-breaking rule. The rule we assume works as follows:

The mechanism chooses an arbitrary order on the mediators and advertisers. It is important that this order is chosen independently of the reports received by the mechanism and the arrival order of the entities (if there is no natural order that can be used, the mechanism can simply choose a uniformly random order). The mechanism then uses this order to break ties when comparing users to slots and when comparing between users (slots) associated with different mediators (advertisers). For example, when comparing the cost of user $`p`$ with the value of a slot $`b`$, the mechanism breaks ties in favor of $`p`$ if and only if the mediator of $`p`$ appears earlier than the advertiser of $`b`$ in the chosen order.

We assume that the report of every mediator induces some order on the set of users of this mediators. The mechanism uses this order to break ties between the costs of users belonging to the same mediator.

Finally, since the slots of a given advertiser are all identical and non-strategic (recall that slots were introduced into the model just for the purpose of simplifying the presentation), any method can be used for tie-breaking between the slots of a given advertiser.

In the rest of this paper, whenever costs/values are compared the comparison is assumed to use the above tie breaking rule. Note that this assumption implies that two values (costs) are equal if and only if they belong to the same slot (user); moreover, the value of a slot is never equal to the cost of a user.

Canonical Assignment

Given a set $`B' \subseteq B`$ of users and a set $`P' \subseteq P`$ of slots, the canonical assignment $`S_c(P', B')`$ is the assignment constructed by the following process. First, we order the slots of $`B'`$ in a decreasing value order $`b_1, b_2, \dotsc, b_{|B'|}`$ and the users of $`P'`$ in an increasing cost order $`p_1, p_2, \dotsc, p_{|P'|}`$. Then, for every $`1 \leq i \leq \min\{|B'|, |P'|\}`$ the canonical assignment $`S_c(B', P')`$ assigns user $`p_i`$ to slot $`b_i`$ if and only if $`v(b_i) > c(p_i)`$.

The canonical assignment is an important tool we use often in the next section. In some places we refer to the user or slot at location $`i`$ of a canonical assignment $`S_c(P', B')`$, by which we mean user $`p_i`$ or slot $`b_i`$, respectively. Additionally, the size $`|S_c(P, B)|`$ of the canonical assignment $`S_c(P, B)`$ is used very often in our proofs, and thus, it is useful to define the shorthand $`\tau = |S_c(P, B)|`$.

The following lemma, which was proved by , shows that the canonical assignment is always an optimal assignment.

The canonical assignment $`S_c(P', B')`$ maximizes $`\GfT(S_c(P', B'))`$ among all the possible assignments of users of $`P'`$ to slots of $`B'`$.

Introduction

Mechanisms for multi-sided markets are given an increasingly growing focus in the research community. The study of such mechanisms is motivated by the continuous growth in the number of transactions and exchanges, and the need for competitiveness, which promotes adoption of new exchange mechanisms. Among the numerous examples of multi-sided markets on the web and in electronic commerce one can find: online advertising exchanges, stock exchanges, business-to-business commerce and bandwidth allocation. Many of these examples represent dynamic and uncertain environments, which led to an interest in online exchange mechanisms.

A natural expectation from an online exchange mechanism is to (approximately) maximize the gain from trade, while guaranteeing desirable economic properties such as incentivizing truthfulness, voluntary participation and avoiding budget deficit. Unfortunately, as far as we know, no previous work has managed to achieve these goals simultaneously. Wurman et al.  presented a mechanism incentivizing truthful reporting from either the buyers or the sellers, but not simultaneously from both. A different mechanism given by Blum et al.  maximizes the social welfare of buyers and non-selling sellers (as opposed to maximizing the gain from trade). Finally, Bredin et al.  present a truthful online double-sided auction that is constructed from a truthful offline double-sided auction rule. However, the competitiveness of ’s mechanism with respect to the optimal trade was only studied empirically.

The failure of the above works to maximize the gain from trade while maintaining truthfulness, individual rationality (voluntary participation) and budget balance (avoiding budget deficit) can be partially attributed to an impossibility result of . This impossibility result states that, even in an offline setting involving a single buyer and a single seller, maximizing the gain from trade while maintaining truthfulness and individual rationality perforce runs a deficit (, is not budget balanced). An additional reason for the above failure is that the matching problem faced by the market maker (exchange mechanism) in multi-sided online markets combines elements of online algorithms and sequential decision making with considerations from mechanism design. More specifically, unlike in a traditional online algorithm, a mechanism for such a setting must provide incentives for players to report truthful information to the mechanism. On the other hand, unlike in traditional mechanism design, this is a dynamic setting with players that arrive over time, and the mechanism must deal with uncertainty and make irrevocable decisions before the arrival of all the players.

In this work we present the first3 online mechanism for a multi-sided market setting which guarantees the economic properties of truthfulness, individual rationality, and budget balance while (approximately) maximizing the gain from trade. Specifically, we consider a market setting presented by . This setting is motivated by online advertising in its foreseeable future form. Online advertising currently supports some of the most important Internet services, including: search, social media and user generated content sites. For online advertising to be effective, companies collect vast amounts of information about users, which increasingly creates privacy concerns. As these concerns are especially pronounced in the European society, EU regulators have actively been looking for ways to improve users’ privacy. One such way, which was suggested by the EU regulators, is development of tools that enable the end user to choose which parts of their private information online advertising platforms are allowed to collect.

The market setting suggested by  based on this motivation includes advertisers as buyers, users as sellers (each willing to sell her own information portfolio through a broker) and information brokers as mediators representing the users. The objective of a mechanism for this setting is to end up with a match between users and advertisers maximizing the gain from trade. Towards that goal, the mechanism has to collect information from the mediators and advertisers; and thus, needs to incentivize the mediators and advertisers to report truthfully, which it can do by charging the advertisers and paying the mediators. Additionally, the mechanism can also recommend for the mediators to forward some of the payment they receive to the users, which allows the mechanism to affect the incentives of the users as well.

In order to convert the above offline setting suggested by  into an online setting, we assume the mediators and advertisers arrive at a uniformly random order, and refer to the arriving advertisers and mediators as arriving entities. Every time that a new entity arrives, the mechanism has an opportunity to assign users to advertisers. More specifically, when a mediator arrives the mechanism is allowed to assign users of the newly arriving mediator to advertisers that have already arrived. Similarly, when an advertiser arrives the mechanism is allowed to assign users of mediators that have already arrived to the newly arriving advertiser. Notice that this means that the mechanism is not allowed to cancel assignments that have already been made, or assign a user of a mediators that has already arrived to an advertiser that has already arrived. These restrictions, together with the random arrival order, represent the online nature of the setting. We note that our choice to model an online market using a random arrival order is a well established practice (see  for a few examples). Intuitively, this modeling choice is based on the assumption that real arrival orders are arbitrary rather than adversarial.

The online nature of our setting raises the question of what it means for a mechanism to be individually rational. As usual, individual rationality should imply that a player never losses by participating. However, in an online setting it is natural to require also that a player never losses by not leaving prematurely. We introduce a new concept called “continuous individual rationality” which captures the above intuitive requirement. Satisfying the requirements of continuous individual rationality, together with the other economic properties our mechanism guarantees, requires the mechanism to use a novel pricing scheme where users may be paid ongoing increments during the mechanism’s execution. The maximum total payment that a user may end up with is pre-known (when the user arrives), however, the actual increments are not pre-know and depend on the market’s online changing demands and supplies. As users rarely ever get paid in reality, this pricing scheme is new to mechanism design and might look odd at first glance. Nevertheless, the principle it is based on can be observed in many common real life scenarios such as executive compensation payments and company acquisition deals. For example, the eBay acquisition of Skype in 2005 involved both an upfront payment and an additional payment whose amount depended on the future performance of the bought company (https://investors.ebayinc.com/releasedetail.cfm?releaseid=176402).

It is interesting to note that the setting of  involves multi-minded players (the advertisers and mediators), , players with a multi-dimensional strategic space. The mechanisms suggested by  for this setting are the only known mechanisms, to date, which circumvent the impossibility result of in a model involving multi-minded players. Our result shows that it is possible to handle multi-minded players also in an online setting.

Our Result

As discussed above, we extend the standard concept of individual rationality in a way that seems to match better the intuitive meaning of individual rationality in an online setting. Specifically, we say that a mechanism is continuously individually rational for a player (a user, a mediator or an advertiser) if the player’s utility can only increase over time when the player is truthful4. In other words, the utility of the player is always $`0`$ before the arrival of the first entity. After the arrival of each entity the mechanism updates the assignment and payments, which can affect the player’s utility. If the change in the utility is always non-negative when the player is truthful, then the mechanism is continuously individually rational for the player. Note that the newly presented concept of continuous individual rationality is a stronger concept than individual rationality in its classic form as it requires two things. First, ex post individual rationality, , the players are never worse off when the mechanism terminates, compared to their situation before the mechanism’s execution. Second, it is individually rational for a player to remain throughout the execution of the mechanism, , each individual step during the execution can only increase the utilities of the players.

Like in , we say that a mechanism is user-side incentive compatible if truthfulness is a dominant strategy5 for each user given that her mediator is truthful (regardless of any parameter of the model, such as the number of mediators, and regardless of the other players’ strategies). Similarly, the mechanism is user-side continuously individually rational if it is continuously individually rational for each user given that her mediator is truthful (again regardless of any parameters and regardless of other players’ strategies). A mechanism is mediator-side incentive compatible if truthfulness is a dominant strategy for each mediator whose users are all truthful, and it is mediator-side continuously individually rational if it is continuously individually rational for every such mediator (again regardless of any parameters or other players’ strategies). Finally, a mechanism is advertiser-side incentive compatible if truthfulness is a dominant strategy for every advertiser, and it is advertiser-side continuously individually rational if it is continuously individually rational for every advertiser. We construct a mechanism which is three-sided incentive compatible (, it is simultaneously user-side incentive compatible, mediator-side incentive compatible and advertiser-side incentive compatible) and also three-sided continuously individually rational (, it is simultaneously user-side continuously individually rational, mediator-side continuously individually rational and advertiser-side continuously individually rational).

Our mechanism is termed “Observe and Price Mechanism” (). The following theorem analyzes the economic properties guaranteed by and its competitive ratio. The parameter $`\alpha`$ is an upper bound, known to the mechanism, on the market importance of any single player. Formally, $`\alpha`$ bounds the ratio between the size of the optimal trade and the maximum capacity of an advertiser or the maximum number of users that a mediator can represent. For large markets, such as the market motivating our work, $`\alpha`$ is expected to be much smaller than $`1`$.

is budget balanced6, three-sided continuously individually rational, three-sided incentive compatible and $`(1 - 9.5\sqrt[6]{\alpha} - 10e^{-2/\sqrt[3]{\alpha}})`$-competitive.

Note that our mechanism is randomized, but is guaranteed, by its three-sided incentive compatibility, to be truthful for all possible random coin flips. This stands in contrast to many randomized mechanisms from the literature, which are typically proven to be truthful only in expectation or with high probability.

Our setting is based on the model of . From a motivational point of view this model is closely related to works that involve mediators and online advertising markets, such as . The models studied by these works are motivated by the current networks exchanges for online display ads, where advertisers buy through mediators (the networks) which in turn buy goods (publishing space) from a single seller. Despite their motivation by a network exchange, these models are actually auctions (, one-sided mechanisms). Moreover, they focus on offline mechanisms maximizing revenue, which is very different from the focus of our work. See  for additional works related to their model, and in particular to the study of mediators.

The uniformly random arrival order of the entities in our setting relates it to the vast literature on the secretary problem. The original form of the secretary problem first appeared around the 1960’s, although its exact origin remains unclear . In this form, the problem asks to select online the maximum value element among a set of elements arriving at a uniformly random order. Connections found between various extensions of the secretary problem and mechanism design have motivated an extensive study of these extensions over the last decade (for an excellent survey on these extensions see ). One extension of the secretary problem which is particularly relevant to our work is an extension in which the arriving elements are the right side nodes of a bipartite graph whose left side is known to the algorithm. The algorithm then have to construct a maximum weight matching online. More specifically, for every arriving right side node the algorithm must decide, immediately and irrevocably, whether to assign it to a left side node, and if so to which one. Kesselheim et al.  describe a $`e^{-1}`$-competitive algorithm for this extension of the secretary problem. Our own setting can be viewed as a variant of this extension involving strategic considerations.

Our presented mechanism is composed of two phases: an observation phase in which the algorithm collects information but makes no assignments, and a matching phase. This partition of the mechanism into two phases is similar to the structure found in many algorithms for the secretary problem and its extensions. Moreover, within the literature on algorithmic game theory, an analog for the observation phase can be found in the work of Goldberg et al.  who described random sampling auctions in which an observed (sampled) set is used in order to compute a threshold for the remaining players. It is important to note, however, that, unlike in our work, focuses on one-sided online auctions with unlimited supply.

In addition to the above described extension of the secretary problem, there has been a significant body of works studying online matching problems with an adversarial arrival order. This body of work was originated by the work of Karp et al.  who described an optimal online algorithm for unweighted bipartite online matching. Later works considered more general settings allowing various kinds of weights (see, for example, ). We note that these works do not refer to strategic considerations despite the fact that they are mostly motivated by online advertisement markets.

Last but not least, we would like to mention matching markets, which are a model related to double-sided auctions—though without money. There is extensive literature on online matching markets. The classic matching algorithm is the deferred acceptance algorithm . This algorithm is truthful for one side of the market, and produces a stable match with respect to reported preferences. Moreover, it is known that there does not exist a stable matching mechanism that is truthful for all players . Readers are referred to for more background on the matching markets literature.


  1. In fact, we even have $`c(p_{\tau}) < \ell(P, B)`$ since the tie-breaking rule defined in Section 7.1 guarantees that the value of a slot is never equal to the cost of a user. ↩︎

  2. In some cases the assumption that the mechanism has a prior knowledge about the number of entities might be considered unnatural. The mechanism we present can be modified using standard techniques to work with an alternative assumption stating that each entity arrives at a uniformly random time from some range (for example, $`[0, 1]`$). We refer the reader to  for more details about the relation between these assumptions. ↩︎

  3. To the best of our knowledge. ↩︎

  4. Informally, a player is truthful if he/she reports the information as it is known to him/her. A formal definition of what does it mean for a user, mediator or advertiser to be truthful is given in Section 7↩︎

  5. Here and throughout the paper, a reference to domination of strategies should be understood as a reference to weak domination. We never refer to strong domination. ↩︎

  6. A mechanism is budget balanced if the amount it charges (from the advertisers) is at least as large as the amount it pays (to the mediators). ↩︎