134x Filetype PDF File size 0.40 MB Source: users.cecs.anu.edu.au
Efficient Reduction of L-infinity Geometry Problems HongdongLi Research School of Information Sciences and Engineering Australian National University and NICTA hongdong.li@anu.edu.au Abstract (binary search) procedure. But, convexoptimization is any- wayanexpensivecomputation. Modern convexoptimizers This paper presents a new method for computing optimal L are mostly based on the Interior-Point-Algorithm, which ∞ solutions for vision geometry problems, particularly for those is known to be polynomial algorithm, meaning that the problems of fixed-dimension and of large-scale. Our strategy for required computation grows polynomially in the size of the solving a large L problemistoreduceittoafinitesetofsmallest problem (i.e. number of constraints and variables). A high ∞ possible subproblems. By using the fact that many of the problems polynomial degree often prevents it from handling too big in question are pseudoconvex, we prove that such a reduction is problems,or its speed is unbearably slow. possible. To actually solve these small subproblems efficiently, we propose a direct approach which makes no use of any convex op- To accelerate the L∞ computation, researchers have ex- timizer (e.g. SOCP or LP), but is based on a simple local Newton ploredvariousways. However,mostofthemhavebeencon- method. We give both theoretic justification and experimental val- centrated either on how to speedup the convex solver itself, idation to the new method. Potentially, our new method can be or on how to reduce the total number of convex program madeextremely fast. iterations (e.g.,[5][2]). As a consequence, despite having all those improve- ments, their methods still have to solve a full-size convex 1. Introduction program using a full-scale convex optimizer at every itera- The L optimization is a rather new and promising di- tions. Since convexoptimizerhasstillpolynomialcomplex- ∞ ity, also may come with a high memory (storage) require- rection of research in multi-view geometry [4, 5, 6]. In just afewyears’timeithasattractedmuchattentionfromthevi- ment, it may not be capable of handling very large-scale sion geometrycommunity(seee.g.[12,17,9,14,2,7,15]). problems. AcriticalvirtueoftheL schemeisthatthesolutionob- The goal of this paper is to develop a new method to ∞ speed up the L computation, particularly for those large- tained is not only geometrically meaningful, but also glob- ∞ ally optimal and hence unique. The unique solution can al- scale applications. We will present a new approach that ef- waysbereliablyfound,regardlessofthesizeoftheproblem fectively solves an L∞ problemwithout solvinganyconvex and the configurations of the cameras. This forms a sharp program. Ourmethodisnotonlytheoreticallyprovable,but contrast to the conventional L method, which is known to also practically efficient. Potentially, it can be easily made 2 extremely fast. be problematic due to local minima or slow convergency. This virtue entails the L∞ scheme particularly suitable 2. Related work for large-scale applications. In these applications, there are often e.g. thousands of views to be triangulated, or thou- In an early work on L∞ vision computation [5], Kahl sands of points to be fit to a model. Paper [19] gives some formulated the problem as quasi-convex program, and pro- instancesoflargeproblems,oneofwhich,the‘NotreDame’ posed an Improved Bisection method based on the idea of data set, involves solving for 595 cameras and 277,887 3D updating the upper-bound adaptively, thus effectively re- points. duces the total number of SOCP iterations. This way, con- However,to process large-scaledata using L∞ there is a siderable time reduction was observed. computationalcomplexity issue. The standard approach for Arecentpaper[2],alsoaimingatreducingthetotalnum- solving an L∞ problem is to convert it into a sequence of ber of iterations, has gained remarkable success. Based convexprograms(e.g. SOCP, second-order cone program), on fractional programming, it introduced five algorithms and iteratively solves many such SOCPs via a bisection (e.g. Dinkelbach algorithm, Gugat algorithm, etc.) to vi- 1 sion community. The common idea is: instead of doing Our method follows from the reduction (or decompo- a naive binary search (bisection) which has linear conver- sition) strategy, which performs by reducing a big original gence, it uses Newton method to cut-downthe total number probleminto a set of small-size subproblems. We call these of SOCPs significantly. small subproblems as atom or primitive problems. Each of Olsson et al. [12] realized that many L∞ functions in the atomproblemsissosmallthatapplyingafull-scalecon- vision are in fact pseudoconvex (which is slightly stronger vex solver is often unnecessary. Rather, more direct and than quasi-convex). They proposed two fast algorithms. more efficient methods (e.g. analytic) may exist and may The first one is based on local search using LOQO, but sufficeforsolvingthem. Thisway,weavoidtheuseofcon- for large applications it suffers from numerical convergence vexoptimizer. problem. The second one is via SOCP approximation, and To be able to apply the above strategy, it is crucial to lately was shown to be a special case of Dinkelbach algo- show that the L∞ problems are indeed “reducible”. To rithm. Our work is significantly motivated by the concept this end, we will establish our main reduction theorem of pseudoconvexity. in section-5. The theorem is substantially grounded on Thenewmethod,to bepresentedin this paper, is rooted the pseudoconvexity theory–which will be reviewed in fromthemathematicaltheoryofLP-typeproblems. Inour section-4. previous work we introduced the elegant LP-type frame- To actually perform the reduction (i.e. form the origi- work to multi-view L∞ computation [7]. We have proven nally big problem to a set of small primitive problems), we that many fixed-dimensionalL∞ geometryproblemsare in present two reduction algorithms in section-7. These al- fact instances of the so-called LP-type problem [10]. How- gorithmsareefficient, inthe sense thatthey will find the op- ever, that paper’s focus was on outliers-removal, and no ef- timal solution in linear expected time. Therefore, our new ficient algorithm was given there. methodis also efficient. After the completion of the work, a very interesting con- Ourefficiencyargumentalso dependsonthe assumption nection was brought to our attention, which is that our that, one can solve those primitive problems extremely ef- method bears a remarkable similarity to the SMO method ficiently. This is often the case, in fact, as the primitive for fast Support Vector Machine training in the machine problems are usually very tiny and very regular. Section-6 learning field [13]. We note that, while these two meth- is devoted to primitive solvers. ods advocate the same idea of reducing to the smallest pos- Indeed, as an example, later we will show a 1000-point sible subproblems, the SMO was designed for the convex L∞planarhomographyproblemcanbereducedto 4-point (quadratic programming) case, and the actual reduction al- homographyand 5-point homographyproblems etc. Obvi- gorithms are also different. ously, to estimate an L∞ homography from 4 points one does not need any sophisticated SOCP; a simple SVD suf- 3. A Preview of the Paper fices. So far, almost all published works on L∞ vision com- 4. The L Minimization and Pseudoconvexity ∞ putation are based on convex optimization. As mentioned earlier, being a polynomial-complexity algorithm, the con- To ease exposition, this section will summarize some vex optimization is expensive, preventing it from handling known results of L∞ in vision computation. Emphasis is very big problems. given to pseudoconvexityand its implications. For this reason, most existing wisdom for accelerating The key idea of the L∞ scheme is to replace the L2 er- large-scale L computation is either through using faster ror norm with the L∞-norm (i.e. minimax norm). Previous ∞ convex solvers, or through reducing the total number of worksshowthatthisleadstoquasi-convexminimization(or convexprograms. quasi-convex program). We call a function quasi-convex if all of its sublevel sets are convex. Quasi-convex minimiza- tion in multi-view geometry often takes the following form However, since convex-optimization appears to be the ([5]): bottleneck here, can we avoid it at all? Motivated by this question, we take a very different angle to attack the prob- kA x+b k min max f (x) = i i (1) i T lem of large-scale L∞ computation. We ask ourselves a x i (c x+di) i novel question: is it possible to effectively solve an L T ∞ s:t: c x+di >0;i=1;:::;N; (2) i problemwithout solving any convex program ? Our answer to this question is “yes.” In this paper, we wherethefi(x) are quasi-convex,x ∈ Rn is the unknowns propose a new method precisely does this: enjoying all the to be solved for. The dimension of the problem is n, which benefitsofL∞withoutusingany(slowandexpensive)con- is often fixed and intrinsic to particular application. For ex- vexoptimizer. ample, n = 3 for multi-view triangulation, n = 6 for 2D affinity, n = 8 for planar homography, n = 7 for funda- This theorem says that, unlike the quasi-convex case, for mental matrix and n = 11 for camera calibration, etc. The pseudoconvexcase the KKT conditionsare not only neces- size (scale) of the problemis definedby N. By ‘large-scale’ sary but also sufficient. This actually suggests a local de- problemswemeanN ≫ningeneral. scent approach for pseudoconvex optimization. Following Thestandardapproachtosolvesuchaquasi-convexpro- from this, paper [12] made an important attempt in using gram is to convert it to iteratively solving the following LOQOtosolve L∞ problems. For small problems LOQO SOCPsviabisection: worked fine, but for large problems it often failed to con- verge. min γ (3) x T s:t: C (x) = kA x+b k−γ(c x+d )≤0;i=1;:::;N; i i i i i Minimax case. Recall that our purpose is to minimize max f (x); where each f (x) is pseudoconvex. Although where C (x) represents the i-th second-order cone (shell i i i i the concept of pseudoconvexitydoes not extend to the min- only). Note that we have multiplied the chirality condition imaxcase, we howeverhavea significant result as follows: to both side. ∗ ∗ Theorem4.4(minimaxKKT) x solves γ = 4.1. Useful results of pseudoconvexity min max f (x); i = 1;:::;N; where f (x) is pseudocon- x i i i vex, if and only if there exist scalars λ such that Paper [12] has proven that the above functions f (x) are i i in fact pseudoconvex in the feasible region. Pseudoconvex N N is a slightly stronger condition than quasi-convex. A pseu- Xλ▽f(x∗)=0; Xλ =1; (6) i i i doconvexfunction is always quasi-convex but the converse i=1 i=1 is not necessarily true. where λ ≥ 0if f (x∗) = γ∗;λ = 0 if f (x∗) < γ∗: i i i i Definition 4.1 A function f is called pseudoconvex if it is ∗ This theorem says that: at the optimum point x of γ, in ¯ ¯ ¯ differentiable and ▽f(x)(x−x) ≥ 0implies f(x) ≥ f(x), each direction (denoted by θ) there must be an ‘i’ such that where ▽ denotes gradient operator [8]. ∗ ▽fi(x )· θ ≥0. Geometrically, this means that in each di- rection there is at least one residual non-decreasing. Proofs Pseudoconvexity is very useful in minimization. Infor- of all the above results can be found in [8, 12]. mally speaking, a local stationary point of a pseudoconvex functionis also its global minimum. To find the global min- 5. Main Reduction Theorem imumitissufficienttofindalocalstationarypoint,andthis can be done sufficiently by solving a KKT system of the We have seen that pseudoconvexity is useful in deriv- problem,as assured by the following two results. ing optimization procedures. In this section, we will further showthatit actually offers more. Lemma4.2 Given a pseudoconvex function f, then we Recall that our intended strategy for solving a big prob- ¯ ¯ have▽f(x) = 0 if and only if f(x) ≥ f(x) for all x. lemofsize N(c.f. Eq.3)is to reduce it to a single or a set of This lemma says that any local stationary point of a pseu- small primitive problems, each of which is of size k ≪N. doconvexfunction is also its global minimum. Byusingpseudoconvexity,we will show this is indeed pos- sible, thanks to the Main Reduction Theorem to be given Theorem4.3(KKTsufficientcondition[8]) Consider a below. general inequality-constrainedminimization problem: Before presenting the theorem, we first point to a useful empirical observation, which offers insight to the theorem. minf(x); s:t: gi(x) ≤ 0;i = 1;:::;N (4) Many authors have observed that, the L∞ optimum is x actually only supported by a small subset of the entire x inaconvex set: constraints set. This empirical ‘fact’ was applied to L ∞ ¯ computations (e.g. [18, 15, 7]). But neither of them has Let x be a feasible solution. Suppose its KKT system holds true at x¯, i.e., there exist scalars λ ≥ 0 such that offeredsufficientjustification, nor answered a key question: i “how big size that subset should be, and how to quickly ▽f(x¯)+Xλi▽gi(x¯) = 0: (5) findit?”. i Suppose further that f(x) and gi(x) are pseudoconvex, 5.1. The Main Theorem then the point x¯ which solves the KKT system is precisely the global optimum. Wenowstatethemainreductiontheorem. Theorem5.1(mainreductiontheorem) Suppose by using our reduction algorithms (to be given Consider L problem min max f (x), where x ∈ in the next section) we have reduced a big N-view problem ∞ x i i Rn; i = 1;:::;N, N > (n + 1), each fi(x) is pseudocon- into a small set of k-view (k ≤ 4) primitive problems. Be- vex. Denote f (x) = max f (x); where I is a subset of cause the primitive problems are so small that one is able I i∈I i N = {1;:::;N}: Also denote f∗ as the minimum of f (x): to solve them directly, efficiently, or even in closed-form. I I Then there must exist a proper subset B which is of size Nowletusshowthis. ∗ ∗ ∗ ∗ |B| ≤ n + 1 such that f = f and x = x are their B N B N 2-viewcase. It is easy to imagine(viageometricintuition) (equal) optimizers. that, solving 2-view L triangulation is equivalent to find- ∞ We call a subset of N a basis set (or basis in short), if ingtwonon-inclusiveconesthataretangentin3-space. The optimum attains, i.e., the γ values is minimal, when the this subset yields the same minimal function value and two cones are tangent with the two gradients at the point if remove any member from it will decrease the function of tangency pointing to opposite directions. This point-of- value. The maximal size (cardinality) of any basis is called tangency gives the optimal solution. Summarizing this up the combinatorial dimension. Hence the combinatorial mathematically and we get: dimensionin the theorem is (n + 1). ( T C (x) = kA x+b k−γ (c x+d )=0; i =1;2 i i i i i Aproof of the theorem is given in the Appendix of the λ▽C1(x)+(1−λ)▽C2(x)=0; λ>0;ascalar: paper. It is adduced in a form most suitable for the purpose. This is almost a system of equations. Having very small Arelated and neat result, though in a different context, can size, the system can be easily solved by any Newtonmethod be found in [11]. (e.g. Levenberg-Marquardt). In solving the system, the last positivity conditioncanberelaxedfirst andreinforcedafter- This theorem predicts that the solution of the big prob- ward. Alineartriangulationmethodmaybeusedtoprovide lem is dominated by a small basis set of at most (n + 1) aninitial point. To ensurethat the linear solution is also fea- constraints. This is very appealing, because solving that sible in terms of chirality, a validation before using it seems small sub-problem yields identically the same result to the necessary. original problem. Now the remaining task is to quickly find ThereadermayworrythatthelocalNewtonmethodmay abasis. not converge globally. The fact is, being pseudoconvex the If the purposeweremerelytofindabasis,thenonecould problem has only one local minimum, which is also the easily do it a posteriori, meaning that extracting a basis set global optimum. In addition, because the problem size is so after having found a solution of the original problem. But tiny and the nonlinearity is so mild that the Newton solver this is meaningless however, for our current purpose of re- is adequate in practice. Our tests hardly encountered any duction. We must have an a priori approach that can iden- difficulties, even when the noise level was high. tify a basis quickly before spending too much time on solv- ing the original big problem. But how ? 3-view case. The 3-view case is a bit tricky. Now the 3 Our answer to this will be given in section-7, in which cones cannot be tangent in any configuration, otherwise it wewillpresenttworeductionalgorithms,bothofwhichcan will reduce to the 2-view case, contradict to the fact that the 3 cones form a basis set. Hence, they must intersect at findabasisinlinearexpectedtime. In the nextsection (i.e., a common point. Moreover, at that point the three gradi- section-6), instead, we will explain how to construct primi- ents must cover all feasible directions, otherwise the point tive solvers. would move therefore not non-optimal, contradict to the preassumption. Mathematically we have: 6. Solving Primitive Problems 8 T >C(x)=kAx+bk−γ(c x+d)=0; i=1;:::;3 i i i i i Inthis section, we will elaborateonhowtoactuallycom- < Xλi▽Ci(x)=0; Xλi=1; λi >0: pute the primitive problems fast and efficiently, without us- > ing convex optimizer. We choose to describe our primitive : i i solvers earlier (before describing the actual reduction algo- 4-view case. The 4-view case, on the other hand, is par- rithms in the next section), is to assure the reader earlier of ticularly simple. Note that we have totally 4 unknowns to the possibility of solving L∞ without SOCP. solve, 3 in x and 1 in γ. Now we have precisely 4 cones in Let us use N-view triangulation as an example to il- the basis set. Hence the solution can be found as a proper lustrate the possibility to construct ad hoc primitive solvers root of the square system of equalities. without SOCP. The combinatorial dimension here is 4. T C (x) = kA x+b k−γ (c x+d )= 0; i = 1;:::;4 Hence only 4 distinct primitive problems exist: 1-view, 2- i i i i i view, 3-view and 4-view (the 1-view case is trivial). Note There may be multiple roots. However, this is no serious, that they are all under the L∞ norm. thanks to the use of local solver, and to the fact that we
no reviews yet
Please Login to review.