qphase v1.0

This is the solving algorithm powering Cuboth, the (as of December 2020) world's fastest robot to solve an unmodified Rubik's Cube. In general, it is a heavily enhanced version of my own rob-twophase solver designed to fully account for Cuboth's complex quad-arm mechanics while searching for solutions. This leads to extremely efficient solutions of just 19 robot moves (already counting all tilts and non-parallel regrips) within milliseconds and is one of the cornerstone improvements over the previous unmodified cube robot record.

NOTE: The primary purpose of this repository is to document the various new techniques that were necessary to adapt Herbert Kociemba's two-phase algorithm for finding solutions particularly efficient with respect to advanced quad-arm robot mechanics (the code here is however of course fully executable). If you are just looking for a very efficient off-the-shelf solver to use in your own projects, you should probably be looking at rob-twophase (unless you are working on a robot very similar to Cuboth).

Now follows a rather detailed description of the main ideas behind the qphase algorithm. Note that the following text assumes good familarity with Kociemba's two-phase algorithm (see for example Kociemba's website or Tomas Rokicki's extensively documented implementation in cube20src).

General Considerations

A simple technique for finding a solution that is not just short in terms of the number of moves but also efficient to physically execute with a robot is to generate several/many solutions (rather than just a single one) and then select the fastest one. While this sometimes already gives reasonably good results, one can generally do quite a bit better by considering the robot mechanics directly within the search. However, depending on the type of mechanics to consider, this can become very challenging to do.

My previous solving algorithm rob-twophase could for example search directly in the axial metric (AX) where two moves on opposite faces count as one (because they can be executed in parallel by an axial robot) or in the axial quarter metric (AXQT) where additonally half-turns count as 2 moves. While doing this was certainly already quite tricky to implement, the corresponding extensions of the standard two-phase algorithms are conceptually rather straight-forward. Basically, one just has to modify the moveset available to the solver. Efficiently considering proper quad-arm robot mechanics is a lot more challenging and requires quite a few new techniques.

Still, the basis of the qphase algorithm is rob-twophase and since a quad-arm robot can also do parallel turns, we want to be searching in either AX or AXQT. While my previous robot SquidCuber used the AXQT mode (since a half-turn took almost twice as long as a quarter-turn), we use AX here. This allows us to avoid any complications caused by the quarter-turn metric and Kociemba's two-phase algorithm is also generally more efficient in half-turn metrics. Further, half-turns are actually not that much slower to execute for quad-arm robots because they generally do not require an immediate ungrip action (which of course costs extra time and is almost always needed after quarter-turns).

Tilts

The biggest mechanical restriction of a quad-arm robot is that it cannot directly turn the top and bottom face of the cube meaning that it generally has to perform several cube rotations throughout a solve. we call those tilts. Since standard solutions may require a lot of tilts we want to code this information directly into the pruning tables.

There are 24 different ways to hold a cube. Naively augmenting the standard phase 1/2 pruning tables with this TILT coordinate would result in a 24x memory increase. This is problematic as the original tables already require > 1 GB of RAM (with an axial version of Rokicki's extended phase 1 table). Fortunately, TILT is highly symmetric. As long as the axis that cannot currently be turned is the same, we can always symmetry transform (with respect to robot symmetry) a solution with tilt1 into one for the same cube in tilt2. Therefore, we can symmetry reduce TILT to only 3 different equivalence classes and consequently achieve an 8x memory reduction. Actually implementing this is, however, quite tricky because the standard Kociemba algorithm already utilizes cube symmetry, which is not independent of the cube's TILT as a symmetry transformation can change the axis that cannot currently be turned. Thus, to perform a double symmetry reduction we first have to reduce the current state with respect to cube symmetry, then adjust the TILT if necessary (via conjugation) to get an actually equivalent state, and finally symmetry reduce the TILT with respect to robot symmetry. Overall, this makes for a very efficient (both in memory and in runtime) method for fully considering cube rotations during the search.

Gripper States

The inability to turn the top and bottom face is not the only mechanical restriction of a quad-arm robot. The grippers may also block each other, e.g. after a move like (R L) executed in neutral position, which then requires a full regrip, i.e. a regrip that cannot be executed in parallel to the next move, thus increasing the length of the solution. While this is certainly an issue, it turns out that most of those situations can be avoided with smart (i.e. potentially preemptive) parallel regripping. This means that the tilt-augmented pruning tables provide good enough lower bounds, which is very fortunate as further coding in gripper information would lead to a dramatic increase in memory consumption.

Handling the gripping dynamics during searching is not at all straight-forward because regripping decisions may have long-term consquences. Consider for example the move sequence F2 R B R B (executed in an initially neutral gripper configuration), then depending on whether or not we regrip R in parallel to F2 we either end with the R or the B gripper in horizontal position 4 moves later. If we folow up with tRL F the former situation is preferrable and for tFB L the latter. This means if we guess a regripping maneuver at F2 in the search, we may have to backtrack at a later point and fix up the regrips, which is of course very inefficient. Alternatively, we could simply branch over both options, however in many situations (e.g. we follow up the previous example with L F) it does not matter which way we regripped many moves ago. Hence, we would potentially be considering many redundant branches and thereby dramatically slowing down the search again.

Fortunately, we can solve all of these issues by taking inspiration from automata theory (as is for example used for regular expression parsing). During the search we don't just keep track of the current gripper state g_i but we maintain a set of possible gripper states S_i. More precisely, a gripper state g is in S_i if there exists a way to regrip during the first i moves of our solution such that we end up in g. If none of our next candidate moves is possible in any of the gripper states in S_i, then the next move must be a full regrip. As a small optimization, we defer full regrips for as long as possible, which may lose us a few good solution in some rare cases but generally speeds up the search significantly. Note that we can encode the set of gripper states with an integer between 0 and 127 (any subset of 7 different gripper states) and that we can "easily" (well, with quite a bit of care at least) precompute all transitions dynamics. This technique allows us to correctly handle the gripper mechanics very efficiently in an implicit manner without worrying about the actual regripping sequence while searching for solutions.

Inverse Search

There is one more detail of the standard two-phase algorithm that causes quite a lot problems for searching directly in the quad-arm dynamics, the 3 search directions. Running the standard Kociemba algorithm on certain transformed versions of the inital state simulates backwards searches, which considerably increase the chance of finding short solutions. However, such a solution must then be inverted to be applicable to the original cube. Normally, a quad-arm robot must start in neutral gripper configuration but may end in an arbitrary one. The inverse of such a solution generally does not start in the neutral state. Further, the gripper dynamics are not necessarily invertible. Thus, we have to precompute also inverse tables and have to ensure that the inverse search must end in the neutral state (but may start in an arbitrary one).

Unfotunately, there is also a more profound problem caused by the fact that a search must start in a certain TILT class but may end in an arbitrary one. This means depending on the current search direction (i.e. the initial assignment of the UD-axis) our search must end in a certain TILT class. Consequently, we need 3 different phase 2 tables (as it is very important to reject bad phase 1 solutions immediately before doing any phase 2 searches). Luckily, since the table entries may differ by at most 1 (one final tilt is sufficient to go from any TILT class into any other), we can store everything in a single table with just 3 extra bits per entry.

After going through all of those hassles, qphase can indeed also utilize inverse searches to find very short solutions even faster.

Optimal Regrips

Previously we discussed that the qphase algorithm considers regrips only implicitly while searching for a solution. This means a returned solution is guaranteed to be executable without any extra full regrips but it does not tell us how to achieve that. An explicit parallel regripping schedule can be figured out efficiently via dynamic programming. However, we can take things one step further and not just find any schedule but the "best" according to some criteria (there are usually several possible ways of executing the same solution). Here we compute the safest to execute, i.e. the solution where the most moves are supported by at least 2 stationary grippers (there is generally a higher chance of something going wrong when a move is only supported by a single stationary gripper). While this is certainly not the most critical optimization it is essentially free from a computational standpoint, so we might as well take it.

Finally, qphase returns multiple solutions allowing further small timesaves by rating them (before selecting the best one) according to robot intricacies (e.g. reliably executing some specific moves may require some extra delay, etc.) that cannot directly be integrated (or would not be worth integrating) into the search process. For Cuboth this saves an additional ~50 milliseconds on average.

efrantar / qphase Goto Github PK

qphase's Introduction

qphase v1.0

General Considerations

Tilts

Gripper States

Inverse Search

Optimal Regrips

qphase's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent