Geometric Methods

论文阅读 Contact-Rich SE(3)-Equivariant Robot Manipulation Task Learning via Geometric Impedance Control

延续上一个Geometric Impedance Control（GIC）的文章内容，由于本文的跟踪的目标是静止的（固定的孔），所以自然$V_d = 0$，则公式变为：$\tilde{T} = \tilde{G} - f_g -K_d e_V = \tilde{G} - f_g -K_d V^b$ 如作者所述，这实际上就是PD控制+重力补偿。其中P项是$f_g$，而D项是$K_d V^b$

之前的工作里的$e_G$在这里叫做GCEV（geometrically consistent error vector）：

$$
e_G = \begin{pmatrix}
e_p \\ e_R
\end{pmatrix} = \begin{pmatrix}
R^\top (p-p_d) \\ (R_d ^\top R - R^\top R_d)^\vee
\end{pmatrix}
$$

然后，作者把普通的阻抗控制律叫做CIC（Cartesian Impedance Control）普通的误差向量叫做CEV

形式化

总体的框架是：变增益的阻抗控制，要调的是$K_p,K_R$。至于怎么调增益，基于的是当前观测到的误差向量（可以选用GCEV还是CEV训练这个调增益的网络），至于怎么把力矩输入到系统，用的是阻抗控制律（可以选用GIC或者CIC）

由状态到action $a_t$的映射可以表示为$s_t$表示为：$a_t = \mu_\theta(s_t)$，而我们再把action经过一个$h$映射得到增益，因此整体可以写为$(K_p, K_R)_t = h(a_t) = h(\mu_\theta(s_t))=\pi_\theta(s_t)$

这个状态是指啥呢？状态就是观测到的误差向量，可以选普通的笛卡尔误差向量，也可以选GCEV

分析

GCEV作为网络输入，确保gain scheduling的方法是左不变的，即，只要轴孔相对位姿不变，增益调节方法一定是一样的

GIC的作用是，让力在本体坐标中表示（而非世界系下），这对于轴孔插拔的例子是正确的做法，因为孔斜过来后力也应该斜过来，直接用自身坐标表示的话就自动能做到这一点

总结

If a force-based policy is left-invariant in the body-frame, e.g., implemented using a neural network with left-invariant features as input, and described in the end-effector body frame, it will be guaranteed to be left-equivariant in the spatial frame

Geometric Methods - Variable Impedance Control

Geometric Methods

论文阅读 Contact-Rich SE(3)-Equivariant Robot Manipulation Task Learning via Geometric Impedance Control

形式化

分析

总结