Оптимизация версии решения проблемы

Известно, что каждая проблема оптимизации / поиска имеет эквивалентную проблему решения. Например, проблема кратчайшего пути

версия для оптимизации / поиска: для заданного неориентированного невзвешенного графа $G = (V, E)$ и двух вершин $v,u\in V$ найдите кратчайший путь между $v$ и $u$ .

версия решения: для неориентированного невзвешенного графа $G = (V, E)$ , двух вершин $v,u\in V$ и неотрицательного целого числа $k$ существует ли в путь $G$ между $u$ и $v$ , длина которого не превосходит $k$ ?

В общем, «Найти $x^*\in X$ st $f(x^*) = \min\{f(x)\mid x\in X\}$ !» становится "Есть ли $x\in X$ st $f(x) \leq k$ ?".

Но верно ли обратное, то есть существует ли эквивалентная проблема оптимизации для каждой проблемы решения? Если нет, то каков пример решения проблемы, которая не имеет эквивалентной проблемы оптимизации?

— Люк Майлз
источник

Этот бит равен нулю?

— Джефф

Вы должны объяснить «эквивалент» более подробно, например, имеете ли вы в виду, что одно может быть решено с использованием другого в качестве оракула / черного ящика за полиномиальное время (или в логарифмическом пространстве)? Вас волнуют все проблемы или только проблемы внутри

NP $\sf{NP}$ ?

— Kaveh

В зависимости от вашей точки зрения, вопрос является либо тривиальным (принять любую задачу решения, у которой нет «

»), либо не отвечающим (как доказать, что «нет эквивалентной опционной проблемы»?). k $k$

— Рафаэль

Как уже говорилось в комментариях, это зависит от определений, как обычно. Моя попытка ответить на это требует довольно много определений, так что это будет еще один пример моей неспособности дать краткие ответы.

Определение: задача оптимизации является кортеж с $(X,F,Z,\odot)$

набор соответствующим образом закодированных (строковых)экземпляровиливходов. $X$
является функциейкоторая отображает каждый экземпляр к множеству извозможных решенийпо . $F$ $x\in X$ $F(x)$ $x$
является целевой функциейкоторая отображает каждую пару , где и , чтобы вещественное число называетсязначениемпо . $Z$ $(x, y)$ $x \in X$ $y\in F(x)$ $Z(x, y)$ $y$
-направление оптимизации, или . $\odot$ $\min$ $\max$

Определение: оптимальное решение экземпляра задачи оптимизации является допустимым решением , для которых . Значение оптимального решения обозначается через $x\in X$ $P_O$ $y\in F(x)$ $Z(x, y)=\odot\{Z(x, y')\mid y'\in F(x)\}$ $Opt(x)$ и назвал оптимальным .

Определение: Задача оценки , обозначаемая , которая соответствует задаче оптимизации является следующей: для данного случая вычислить если $P_E$ $P_O$ $x\in X$ $Opt(x)$ $x$ имеет оптимальное решение, и вывести «нет оптимального решения» в противном случае.

Обратите внимание, что это просто запрашивает значение оптимального решения, а не самого решения со всеми его деталями.

Определение. Задача решения , обозначаемая соответствует задаче оптимизации имеет следующий вид. Для пары , где и , решить, есть ли у выполнимое решение такое, что если и такое, что $P_D$ $P_O$ $(x, k)$ $x\in X$ $k\in\mathbb{Q}$ $x$ $y$ $Z(x, y)\le k$ $\odot=\min$ если . $Z(x, y)\ge k$ $\odot=\max$

Первое наблюдение состоит в том, что в настоящее время . Доказательство не сложно и здесь опущено. $P_O\in \mathrm{NPO} \Rightarrow P_D\in \mathrm{NP}$

Теперь интуитивно и , соответствующий не сложнее , чем сам. Чтобы выразить это чувство формально (тем самым определяя, что эквивалентно $P_E$ $P_D$ $P_O$ $P_O$ должен означать), мы будем использовать сокращения.

Напомним, что язык сводится за полиномиальное время к другому языку если существует функция , вычислимая за полиномиальное время, такая, что для всех слов , . Этот вид сводимости известен как сводимость по Карпу или многозначность , и если сводится к таким образом, мы выражаем это, записывая $L_1$ $L_2$ $f$ $x$ $x\in L_1\Leftrightarrow f(x)\in L_2$ $L_1$ $L_2$ $L_1\le_m L_2$ , Это центральное понятие в определении NP-полноты.

К сожалению, сокращения «один к одному» идут между языками, и неясно, как их использовать в контексте проблем оптимизации. Поэтому мы должны рассмотреть другой вид сводимости, сводимость по Тьюрингу . Сначала нам нужно это:

Определение: оракул для задачи является (гипотетический) подпрограммой , которая может решить экземпляры $P$ $P$ в постоянная время.

Определение: задача сводится по Тьюрингу за полиномиальное время к задаче , записанной как , если экземпляры могут быть решены за полиномиальное время алгоритмом с доступом к оракулу для . $P_1$ $P_2$ $P_1\le_T P_2$ $P_1$ $P_2$

Неформально, как и при , соотношение выражает, что не сложнее, чем . Также легко видеть, что если может быть решена за полиномиальное время, то и может быть решена . Снова является транзитивным отношением. Следующий факт очевиден: $\le_m$ $P_1\le_T P_2$ $P_1$ $P_2$ $P_2$ $P_1$ $\le_T$

Пусть , то . $P_O\in \mathrm{NPO}$ $P_D\le_T P_E\le_T P_O$

Потому что , учитывая полное решение, вычисляя его значение и решить , отвечает ли она связанного $k$ , просто.

Определение: если для двух задач и выполняются оба отношения , , пишем ; наше понятие эквивалентности . $P_1$ $P_2$ $P_1\le_T P_2$ $P_2\le P_1$ $P_1\equiv_T P_2$

Теперь мы готовы доказать, что если соответствующая задача оптимизации имеет вид а целочисленное. Мы должны показать, что имеет место. Мы можем определить с бинарным поиском usign в Orcale для . Определение $P_D\equiv_T P_E$ $P_O\in \mathrm{NPO}$ $Z$ $P_E \le_T P_D$ $\odot\{Z(x,y)\mid y\in F(x)\}$ $P_D$ $\mathrm{NPO}$ гарантирует, что для некоторого полинома, поэтому число шагов в бинарном поиске является полиномиальным от, $|Z(x, y)|\le 2^{q(|x|)}$ $q$ $|x|$ $\Box$

Для задачи оптимизации отношение к менее ясно. Во многих конкретных случаях можно прямо показать, что $P_O$ $P_E$ $P_D\equiv_T P_E \equiv_T P_O$ . To prove that this holds generally within the framework given here we need an additional assumption.

First we need to extend $\le_m$ from pairs of languages to pairs of the corresponding decision problems. Then it is easy to see that $\le_T$ is more general than $\le_m$ .

Let $P$ and $P'$ be decision problems; then $P\le_m P' \Rightarrow P\le_T P'$ . This holds because a many-to-one reduction can be interpreted as making use of an oracle in a very restricted way: The oracle is called once, at the very end, and its result is also returned as the overall result. $\Box$

Now we are ready for the finale:

Let $P_O\in \mathrm{NPO}$ and suppose $Z$ is integer-valued and that $P_D$ is NP-complete, then

P D \equiv T P E \equiv T P O .

$P_D\equiv_T P_E \equiv_T P_O.$ With the previous observations it remains to show

PO≤TPE $P_O\le_T P_E$ . To do this we will exhibit a problem

P′O∈NPO $P_O'\in \mathrm{NPO}$ such that

PO≤TP′E $P_O\le_T P_E'$ . Then we have

P O \leq T P' E \leq T P' D \leq T P D \leq T P E .

$P_O\le_T P_E' \le_T P_D'\le_T P_D\le_T P_E.$ The second and third

≤T $\le_T$ hold because of the equivalence of the decision and evaluation version proofed earlier. The third

≤T $\le_T$ follows from the NP-completness of

PD $P_D$ and the two facts mentioned before, namely

PO∈NPO⇒PD∈NP $P_O\in \mathrm{NPO} \Rightarrow P_D\in \mathrm{NP}$ and

P≤mP′O⇒P≤TP′O $P\le_m P_O' \Rightarrow P\le_T P_O'$ .

Now the details: Assume that the feasible solutions of $P_O$ are encoded using an alphabet $\Sigma$ equipped with a total order. Let $w_0, w_1, \ldots$ be the words from $\Sigma^*$ listed in order of nondecreasing length and lexicographic order within the blocks of words with common length. (Thus $w_0$ is the empty word.) For all $y\in\Sigma^*$ let $\sigma(y)$ denote the unique integer $i$ such that $y=w_i$ . Both $\sigma$ and $\sigma^{-1}$ can be computed in polynomial time. Let $q$ be a polynomial such that for all $x\in X$ and all $y\in F(x)$ we have $\sigma(y)<2^{q(|x|)}$ .

Now the problem $P_O'$ is identical to $P_O$ except for a modified objective function $Z'$ . For $x\in X$ and $y\in F(x)$ we take $Z'(x, y)=2^{q(|x|)}\cdot Z(x,y)+\sigma(y)$ . $Z'$ is computable in polynomial time thus $P_O'\in \mathrm{NPO}$ .

To show that $P_O\le_T P_E'$ we observe that $x$ is feasible for $P_O$ if and only if it is feasible for $P_E'$ . We can assume that this is the case, since the opposite case is trivial to handle.

The substituion of $Z'$ for $Z$ is monotonic in the sense that for all $y_1, y_2\in F(x)$ , if $Z(x, y_1)<Z(x, y_2)$ then $Z'(x, y_1)<Z'(x, y_2)$ . This implies that every optimal solution for $x$ in $P_O'$ is an optimal solution of $x$ in $P_O$ . Thus our task reduces to the computation of an optimal solution $y$ of $x$ in $P_O'$ .

Querying the oracle for $P_E'$ we can get the value of $Z'(x,y)=2^{q(|x|)}\cdot Z(x,y)+\sigma(y)$ . Forming the remainder of this number modulo $2^{q(|x|)}$ yields $\sigma(y)$ from which $y$ can be computed in polynomial time.

— uli
источник

"An oracle for a problem P is a (hypothetical) subroutine that can solve instances of P in constant time." Must an oracle take only constant time?

— Tim

@Tim Of course there are books, I listed a few in the comments of another answer

— uli

@Tim Regarding the oracle: If you have found/conceived a reduction

$A\le_T B$ between two problems

$A$ and

$B$ you have reduced the problem of finding an efficient algorithm for

$A$ to finding an efficient algorithm for

$B$ . Or in other words the reduction tells you that in order to solve

$A$ you can use

$B$ . It is like using a subroutine for

$B$ in an algorithm for

$A$ . However the problems

$A$ and

$B$ are often problems where we don’t know efficient solutions. And in case of Turing-reducibility we even use it in cases where the problems involved aren’t decidable at all.

— uli

@Tim Thus

$B$ is an unknown subroutine. It has become a custom in complexity theory to call the hypothetical algorithm for

$A$ derived from the reduction as an algorithm with oracle $B$ . Calling the unknown subroutine for

$B$ an oracle just expresses that we can’t hope to find an efficient algorithm for

$B$ just as we can’t hope to obtain an oracle for

$B$ . This choice is somewhat unfortunate, as it connotes a magical ability. The cost for the oracle should be

$|x|$ as a subroutine has at least to read the input

$x$ .

— uli

An excellent answer all around; the only thing I would add (coming at it now via another question) is that the 'optimization direction' is a needless bit of complexity and for concreteness we can always presume that the objective function

$Z$ is to be maximized; if the intention is to minimize, then we can just define a new objective function

$Z'=-Z$ and rewrite all the minimization of

$Z$ as maximization of

$Z'$ .

— Steven Stadnicki

As the comments say, the answer depends on the exact definitions. Let me interpret the question in a very basic (even naïve) way.

Let $S$ be some relation, that is $S \subseteq \{ (a,b) \mid a,b \in \Sigma^*\}$ .

Now we define a search problem for $S$ :

Given $a$ , find a $b$ such that $(a,b) \in S$ .

and a decision problem for $S$ :

Given $(a,b)$ answer whether or not $(a,b) \in S$ .

_{(for instance, in the example given in the question, $S$ will hold all the pairs $(u,v,k)$ such that there exists a path between $u$ and $v$ which is shorter than $k$ .)}

Note that these two problems are well defined. For this definition, we can ask whether the two problems are "equivalent" for any $S$ . In "equivalent" I mean that if one of them is computable (i.e., there exists an algorithm that solves it) than the other one is computable as well. In general, they are not.

Claim 1: Decision implies Search.

Proof: Let $D_S$ be the algorithm that solves the decision problem of $S$ . Given an input $a$ , We can run $D_S(a,x)$ for any $x\in \Sigma^*$ , one after the other, or in parallel. If there exists $b$ such that $(a,b)\in S$ , we will eventually find it. If not, the algorithm might not stop $^\dagger$ .

Claim 2: Search does not imply Decision.

The reason is that the search algorithm might return a different $b$ than the one we need. That is, for every $a$ there is some $b$ that is very easy to find, but other $b'$ that is not. For instance, let $L$ be some undecidable language, then define

$S = \{ (x,0) \mid x\in \Sigma^*\} \cup \{ (x,1) \mid x \in L\}.$ For every

$x$ the search algorithm can return

$0$ . But no decision algorithm can answer correctly whether

$(x,1) \in S$ , for all the pairs

$(x,1)$ . If it could, it would have decided an undecidable problem, which is impossible.

$^\dagger$ This depends on $S$ . If, for instance, $S$ is bounded, there might exists an algorithm that does stop.

— Ran G.
источник

The right decision problem is existence of

$b$ s.t.

$\langle a,b \rangle \in S$ .

— Kaveh

If decision is defined as the existence of

$b$ , then search implies decision.

— Ran G.

In a weak sense, i.e. w.r.t. computability but not complexity is a more delicate issue.

— Kaveh