第六章：线性系统的反馈与综合

一、引言

在实际控制问题中，我们往往希望系统不仅稳定，而且具有期望的动态性能，例如响应速度快、超调小等。通过引入反馈，我们可以改变系统的动态特性，使其满足设计要求。

本章主要讨论两种反馈控制策略：

状态反馈 (State Feedback)：假设系统的所有状态变量都可以直接测量，并利用这些状态信息构成反馈控制律。
输出反馈 (Output Feedback)：当系统的状态变量不能完全直接测量时，利用可测量的输出信息构成反馈控制律。这通常需要设计状态观测器 (State Observer) 来估计不可测量的状态。

二、状态反馈与极点配置 (State Feedback and Pole Placement)

1. 状态反馈控制器

考虑线性定常系统： \[ \dot{\textbf{x}}(t) = A\textbf{x}(t) + B\textbf{u}(t) \] \[ \textbf{y}(t) = C\textbf{x}(t) + D\textbf{u}(t) \] 引入状态反馈控制律，形式为： \[ \textbf{u}(t) = \textbf{r}(t) - K\textbf{x}(t) \] 其中：

\(\textbf{r}(t)\) 是外部参考输入（指令信号）。
\(K\) 是 \(r \times n\) 维的状态反馈增益矩阵。
\(-K\textbf{x}(t)\) 是反馈部分。

[图片：带状态反馈的系统框图，如 `chapter6_Design_of_Feedback_Control_System.pdf` P7]

将控制律代入状态方程，得到闭环系统的状态方程： \[ \dot{\textbf{x}}(t) = A\textbf{x}(t) + B(\textbf{r}(t) - K\textbf{x}(t)) \] \[ \dot{\textbf{x}}(t) = (A-BK)\textbf{x}(t) + B\textbf{r}(t) \] 闭环系统的系统矩阵变为 \(A_{cl} = A-BK\)。闭环系统的输出方程为： \[ \textbf{y}(t) = C\textbf{x}(t) + D(\textbf{r}(t) - K\textbf{x}(t)) = (C-DK)\textbf{x}(t) + D\textbf{r}(t) \] (若 \(D=0\)，则 \(\textbf{y}(t) = C\textbf{x}(t)\))

核心思想：通过选择合适的反馈增益矩阵 \(K\)，可以改变闭环系统矩阵 \(A-BK\) 的特征值，从而改变闭环系统的极点位置，进而改善系统的动态性能和稳定性。

2. 状态反馈对系统性质的影响

能控性：状态反馈不改变系统的能控性。如果系统 \((A,B)\) 是能控的，则系统 \((A-BK, B)\) 也是能控的。 (证明思路：比较能控性矩阵 \(Q_c(A,B)\) 和 \(Q_c(A-BK,B)\) 的秩，可以证明两者相等。参考 chapter6_Design_of_Feedback_Control_System.pdf P8)
能观测性：状态反馈可能改变系统的能观测性。如果系统 \((A,C)\) 是能观测的，系统 \((A-BK, C)\) 不一定是能观测的。例题 (参考P9)： \(\dot{\textbf{x}} = \begin{bmatrix} 0 & 1 \\ 1 & 0 \end{bmatrix}\textbf{x} + \begin{bmatrix} 0 \\ 1 \end{bmatrix}u, y = \begin{bmatrix} 0 & 1 \end{bmatrix}\textbf{x}\) 原系统 \((A,B)\) 能控，\((A,C)\) 能观测。取状态反馈 \(K = \begin{bmatrix} 1 & 0 \end{bmatrix}\)，则 \(u = r - x_1\)。 \(A-BK = \begin{bmatrix} 0 & 1 \\ 1 & 0 \end{bmatrix} - \begin{bmatrix} 0 \\ 1 \end{bmatrix}\begin{bmatrix} 1 & 0 \end{bmatrix} = \begin{bmatrix} 0 & 1 \\ 1 & 0 \end{bmatrix} - \begin{bmatrix} 0 & 0 \\ 1 & 0 \end{bmatrix} = \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}\)。闭环系统 \((A-BK, C)\) 的能观测性矩阵 \(Q_o = \begin{bmatrix} C \\ C(A-BK) \end{bmatrix} = \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}\)，秩为1，系统变为不能观测。这是因为状态反馈引入了零极点对消。原系统传递函数 \(G(s) = C(sI-A)^{-1}B = \frac{s}{s^2-1}\)。闭环系统传递函数(从r到y) \(G_K(s) = C(sI-(A-BK))^{-1}B = \frac{s}{s^2} = \frac{1}{s}\)。极点 \(s=0\) 和零点 \(s=0\) 对消。

3. 极点配置 (Pole Placement / Pole Assignment)

定理 (Wonham, 1967)：线性定常系统 \(\dot{\textbf{x}} = A\textbf{x} + B\textbf{u}\) 的闭环极点可以通过状态反馈 \(\textbf{u} = -K\textbf{x}\) (这里暂时不考虑参考输入 \(\textbf{r}\)) 任意配置（即闭环系统矩阵 \(A-BK\) 的特征值可以任意指定，只要复数特征值成共轭对出现）的充分必要条件是：系统 \((A,B)\) 完全能控。

设计目标：给定一组期望的闭环极点 \(\lambda_1^*, \lambda_2^*, \dots, \lambda_n^*\)，确定反馈增益矩阵 \(K\)，使得 \(A-BK\) 的特征值恰好是这些期望极点。

方法一：直接比较法 (适用于低阶系统)

设期望的闭环特征多项式为 \(\Delta_d(s) = (s-\lambda_1^*)(s-\lambda_2^*)\dots(s-\lambda_n^*) = s^n + \bar{\alpha}_{n-1}s^{n-1} + \dots + \bar{\alpha}_0\)。
计算闭环系统矩阵 \(A-BK\) (其中 \(K\) 的元素为待定系数)。
计算实际的闭环特征多项式 \(\Delta(s) = \det(sI-(A-BK))\)。
比较 \(\Delta_d(s)\) 和 \(\Delta(s)\) 的同次幂系数，建立关于 \(K\) 中元素的方程组，求解 \(K\)。

例题 (参考P12，SISO系统) \(A = \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}, B = \begin{bmatrix} 0 \\ 1 \end{bmatrix}\)。期望极点为 \(-1+j, -1-j\)。期望特征多项式 \(\Delta_d(s) = (s-(-1+j))(s-(-1-j)) = (s+1-j)(s+1+j) = (s+1)^2 - (j)^2 = s^2+2s+1+1 = s^2+2s+2\)。设 \(K = \begin{bmatrix} k_1 & k_2 \end{bmatrix}\)。 \(A-BK = \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix} - \begin{bmatrix} 0 \\ 1 \end{bmatrix}\begin{bmatrix} k_1 & k_2 \end{bmatrix} = \begin{bmatrix} 0 & 1 \\ -k_1 & -k_2 \end{bmatrix}\)。实际特征多项式 \(\det(sI-(A-BK)) = \det \begin{bmatrix} s & -1 \\ k_1 & s+k_2 \end{bmatrix} = s(s+k_2) - (-1)k_1 = s^2+k_2s+k_1\)。比较系数：\(k_2=2, k_1=2\)。所以 \(K = \begin{bmatrix} 2 & 2 \end{bmatrix}\)。

方法二：基于可控标准型的变换法 (Ackermann公式的前身或等效方法)

(参考 chapter6_Design_of_Feedback_Control_System.pdf P13-P18，主要针对SISO系统) 如果系统 \((A,B)\) 能控，总可以找到一个非奇异变换矩阵 \(P_c\) (PPT中用 \(P^{-1}\) 表示能控性矩阵和系数阵的乘积，然后用 \(P\) 进行相似变换)，将系统 \((A,B)\) 变换为可控标准型 \((\bar{A}, \bar{B})\)： \(\bar{A} = P_c A P_c^{-1} = \begin{bmatrix} 0 & 1 & 0 & \dots & 0 \\ 0 & 0 & 1 & \dots & 0 \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & 0 & \dots & 1 \\ -\alpha_0 & -\alpha_1 & -\alpha_2 & \dots & -\alpha_{n-1} \end{bmatrix}\) \(\bar{B} = P_c B = \begin{bmatrix} 0 \\ 0 \\ \vdots \\ 0 \\ 1 \end{bmatrix}\) 其中 \(s^n + \alpha_{n-1}s^{n-1} + \dots + \alpha_0 = \det(sI-A)\) 是原系统的特征多项式。

设状态反馈为 \(\textbf{u} = -\bar{K}\bar{\textbf{x}}\)，则闭环系统矩阵为 \(\bar{A}-\bar{B}\bar{K}\)。 \(\bar{A}-\bar{B}\bar{K} = \begin{bmatrix} 0 & 1 & \dots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \dots & 1 \\ -\alpha_0-\bar{k}_1 & -\alpha_1-\bar{k}_2 & \dots & -\alpha_{n-1}-\bar{k}_n \end{bmatrix}\) (注意：PPT P14 的 \(\bar{A}\) 是另一种可控标准型，其最后一行是特征多项式系数，\(\bar{B}\) 第一个元素是1。这里采用与第二章一致的可控标准型。) 其特征多项式为 \(s^n + (\alpha_{n-1}+\bar{k}_n)s^{n-1} + \dots + (\alpha_0+\bar{k}_1)\)。设期望特征多项式为 \(s^n + \bar{\alpha}_{n-1}s^{n-1} + \dots + \bar{\alpha}_0\)。比较系数可得：\(\bar{k}_i = \bar{\alpha}_{i-1} - \alpha_{i-1}\) (for \(i=1, \dots, n\)) 则 \(\bar{K} = \begin{bmatrix} \bar{k}_1 & \bar{k}_2 & \dots & \bar{k}_n \end{bmatrix} = \begin{bmatrix} \bar{\alpha}_0-\alpha_0 & \bar{\alpha}_1-\alpha_1 & \dots & \bar{\alpha}_{n-1}-\alpha_{n-1} \end{bmatrix}\)。原始系统的反馈增益为 \(K = \bar{K}P_c\)。

变换矩阵 \(P_c\) 的构造 (参考PPT P13，但符号可能不同)：设原系统特征多项式 \(\det(sI-A) = s^n + \alpha_{n-1}s^{n-1} + \dots + \alpha_0\)。能控性矩阵 \(Q_c = \begin{bmatrix} B & AB & \dots & A^{n-1}B \end{bmatrix}\) (对于SISO，B是列向量b)。 \(P_c^{-1} = Q_c \begin{bmatrix} \alpha_1 & \alpha_2 & \dots & \alpha_{n-1} & 1 \\ \alpha_2 & \alpha_3 & \dots & 1 & 0 \\ \vdots & \vdots & \ddots & \vdots & \vdots \\ \alpha_{n-1} & 1 & \dots & 0 & 0 \\ 1 & 0 & \dots & 0 & 0 \end{bmatrix}\) (这是一个常见的变换矩阵形式，具体形式可能因标准型定义而异) 或者，更直接的是**Ackermann公式 (SISO系统)**：如果系统 \((A,b)\) 能控，期望特征多项式为 \(\Delta_d(s) = s^n + \bar{\alpha}_{n-1}s^{n-1} + \dots + \bar{\alpha}_0\)。则反馈增益行向量 \(K = \begin{bmatrix} 0 & 0 & \dots & 0 & 1 \end{bmatrix} Q_c^{-1} \Delta_d(A)\) 其中 \(\Delta_d(A) = A^n + \bar{\alpha}_{n-1}A^{n-1} + \dots + \bar{\alpha}_1 A + \bar{\alpha}_0 I\)。

例题 (参考P19-P20，SISO系统) \(A = \begin{bmatrix} 0 & 1 & 0 \\ 1 & -1 & 1 \\ 0 & 1 & 0 \end{bmatrix}, B = \begin{bmatrix} 1 \\ 1 \\ 0 \end{bmatrix}\) 期望极点：\(-1, -2+j2, -2-j2\)。 \(\Delta_d(s) = s^3+5s^2+12s+8\)。所以 \(\bar{\alpha}_2=5, \bar{\alpha}_1=12, \bar{\alpha}_0=8\)。原系统特征多项式 \(\det(sI-A) = s^3+s^2-2s\)。所以 \(\alpha_2=1, \alpha_1=-2, \alpha_0=0\)。 \(\bar{K} = \begin{bmatrix} \bar{\alpha}_0-\alpha_0 & \bar{\alpha}_1-\alpha_1 & \bar{\alpha}_2-\alpha_2 \end{bmatrix} = \begin{bmatrix} 8-0 & 12-(-2) & 5-1 \end{bmatrix} = \begin{bmatrix} 8 & 14 & 4 \end{bmatrix}\)。 (注意：PPT P18的 \(\overline{k}_i = \overline{\alpha}_i - \alpha_i\) 是指 \(\overline{\alpha}_i\) 是 \(s^{n-i}\) 的系数，\(\alpha_i\) 是原系统 \(s^{n-i}\) 的系数。所以 \(\bar{k}_1 = \bar{\alpha}_1-\alpha_1\) (对应 \(s^{n-1}\))，... \(\bar{k}_n = \bar{\alpha}_n-\alpha_n\) (对应 \(s^0\))。这里若按 \(s^n + \bar{\alpha}_{n-1}s^{n-1} + \dots + \bar{\alpha}_0\) 和 \(s^n + \alpha_{n-1}s^{n-1} + \dots + \alpha_0\) 形式，则 \(\bar{K} = \begin{bmatrix} \bar{\alpha}_0-\alpha_0 & \bar{\alpha}_1-\alpha_1 & \dots & \bar{\alpha}_{n-1}-\alpha_{n-1} \end{bmatrix}\) 对应PPT P20的 \(\bar{k}=\begin{bmatrix}4&14&8\end{bmatrix}\)，对应的是 \(\bar{k}_1=\bar{\alpha}_2-\alpha_2 = 5-1=4\) (对应 \(s^2\) 系数)，\(\bar{k}_2=\bar{\alpha}_1-\alpha_1 = 12-(-2)=14\) (对应 \(s^1\) 系数)，\(\bar{k}_3=\bar{\alpha}_0-\alpha_0 = 8-0=8\) (对应 \(s^0\) 系数)。所以 \(\bar{K} = \begin{bmatrix} 8 & 14 & 4 \end{bmatrix}\) (按 \(s^0, s^1, s^2\) 顺序)。变换矩阵 \(P_c^{-1} = Q_c \begin{bmatrix} \alpha_1 & \alpha_2 & 1 \\ \alpha_2 & 1 & 0 \\ 1 & 0 & 0 \end{bmatrix}\) (SISO, n=3时)。 \(Q_c = \begin{bmatrix} B & AB & A^2B \end{bmatrix} = \begin{bmatrix} 1 & 1 & 0 \\ 1 & 0 & 2 \\ 0 & 1 & 0 \end{bmatrix}\)。 \(P_c^{-1} = \begin{bmatrix} 1 & 1 & 0 \\ 1 & 0 & 2 \\ 0 & 1 & 0 \end{bmatrix} \begin{bmatrix} -2 & 1 & 1 \\ 1 & 1 & 0 \\ 0 & 0 & 0 \end{bmatrix}\) (这里PPT P19的 \(\alpha_i\) 矩阵是 \(n \times n\)的，与这里的 \(3 \times 3\) 不同，需要核对PPT P13的公式) PPT P19 的 \(Q = G_c \begin{bmatrix} 1 & \alpha_1 & \alpha_2 \\ 0 & 1 & \alpha_1 \\ 0 & 0 & 1 \end{bmatrix}\) (这里 \(\alpha_i\) 是 \(s^{n-i}\) 的系数，\(\alpha_1=1, \alpha_2=-2, \alpha_3=0\) for \(s^3+\alpha_1s^2+\alpha_2s+\alpha_3\)) \(Q = \begin{bmatrix} 1 & 1 & 0 \\ 1 & 0 & 2 \\ 0 & 1 & 0 \end{bmatrix} \begin{bmatrix} 1 & 1 & -2 \\ 0 & 1 & 1 \\ 0 & 0 & 1 \end{bmatrix} = \begin{bmatrix} 1 & 2 & -1 \\ 1 & 1 & 0 \\ 0 & 1 & 1 \end{bmatrix}\)。 \(P_c = Q^{-1} = \frac{1}{2}\begin{bmatrix} -1 & 3 & -1 \\ 1 & -1 & 1 \\ -1 & 1 & 1 \end{bmatrix}\)。 \(K = \bar{K}P_c = \begin{bmatrix} 8 & 14 & 4 \end{bmatrix} \frac{1}{2}\begin{bmatrix} -1 & 3 & -1 \\ 1 & -1 & 1 \\ -1 & 1 & 1 \end{bmatrix} = \begin{bmatrix} 1 & 3 & 9 \end{bmatrix}\)。这与PPT P20结果一致。

状态反馈不改变系统传递函数的零点。

4. 能镇定性 (Stabilizability)

如果系统 \((A,B)\) 不是完全能控的，但其所有不能控的状态模式 (即 \(A_{\bar{c}}\) 的特征值) 都是稳定的 (即具有负实部)，则称该系统是能镇定的。此时，可以通过状态反馈将能控部分的极点任意配置（通常配置到左半平面），而不能控部分自身是稳定的，从而使整个闭环系统稳定。

三、状态观测器设计 (Observer Design)

当系统的状态变量不能直接测量时，需要设计状态观测器来估计系统状态。

1. 状态观测器的结构 (Luenberger Observer)

考虑系统 \(\dot{\textbf{x}} = A\textbf{x} + B\textbf{u}\), \(\textbf{y} = C\textbf{x}\) (假设 \(D=0\))。状态观测器的思想是构造一个与原系统模型相同的动态系统，并利用实际输出 \(\textbf{y}\) 与估计输出 \(\hat{\textbf{y}}\) 之间的差值进行校正。观测器方程： \[ \dot{\hat{\textbf{x}}} = A\hat{\textbf{x}} + B\textbf{u} + L(\textbf{y} - \hat{\textbf{y}}) \] \[ \hat{\textbf{y}} = C\hat{\textbf{x}} \] 其中：

\(\hat{\textbf{x}}\) 是状态 \(\textbf{x}\) 的估计值。
\(L\) 是 \(n \times m\) 维的观测器增益矩阵。

[图片：带观测器的系统框图，如 `chapter6_Design_of_Feedback_Control_System.pdf` P39]

定义状态估计误差 \(\textbf{e}(t) = \textbf{x}(t) - \hat{\textbf{x}}(t)\)。对误差求导： \(\dot{\textbf{e}} = \dot{\textbf{x}} - \dot{\hat{\textbf{x}}}\) \(= (A\textbf{x} + B\textbf{u}) - (A\hat{\textbf{x}} + B\textbf{u} + L(\textbf{y} - C\hat{\textbf{x}}))\) \(= A\textbf{x} - A\hat{\textbf{x}} - L(C\textbf{x} - C\hat{\textbf{x}})\) \(= A(\textbf{x} - \hat{\textbf{x}}) - LC(\textbf{x} - \hat{\textbf{x}})\) \[ \dot{\textbf{e}}(t) = (A-LC)\textbf{e}(t) \] 这是一个齐次线性系统。如果矩阵 \(A-LC\) 的所有特征值都具有负实部，则估计误差 \(\textbf{e}(t) \to \textbf{0}\) (当 \(t \to \infty\))，即 \(\hat{\textbf{x}}(t) \to \textbf{x}(t)\)。

2. 观测器极点配置

定理：线性定常系统 \(\dot{\textbf{x}} = A\textbf{x} + B\textbf{u}\), \(\textbf{y} = C\textbf{x}\) 的观测器误差动态矩阵 \((A-LC)\) 的极点可以任意配置的充分必要条件是：系统 \((A,C)\) 完全能观测。

这与状态反馈的极点配置是对偶的。如果 \((A,C)\) 能观测，则 \((A^T, C^T)\) 能控。我们可以为 \((A^T, C^T)\) 设计一个状态反馈增益 \(K_o\) 使得 \(A^T-C^T K_o\) 具有期望极点。令 \(L = K_o^T\)，则 \(A-LC\) 的特征值与 \((A^T-C^T L^T)^T\) 的特征值相同，即与 \(A^T-C^T K_o\) 的特征值相同。

设计步骤：类似于状态反馈增益 \(K\) 的设计，可以通过直接比较法或基于可观标准型的变换法（或Ackermann公式的对偶形式）来设计观测器增益 \(L\)。 Ackermann公式对偶形式 (SISO系统)：如果系统 \((A,c)\) 能观测 (c为行向量)，期望观测器误差特征多项式为 \(\Delta_e(s) = s^n + \hat{\alpha}_{n-1}s^{n-1} + \dots + \hat{\alpha}_0\)。则观测器增益列向量 \(L = \Delta_e(A) (Q_o^T Q_o)^{-1} Q_o^T \begin{bmatrix} 0 \\ \vdots \\ 0 \\ 1 \end{bmatrix}\) (这是一个复杂形式，更常用的是先求 \(A^T-C^T L^T\) 的 \(L^T\)) 或者直接：\(L = \Delta_e(A) (Q_o)^{-1} \begin{bmatrix} 0 \\ \vdots \\ 0 \\ 1 \end{bmatrix}\) (若 \(Q_o\) 可逆，但 \(Q_o\) 是 \(mn \times n\) 矩阵，一般不可逆) 更简洁的对偶形式：设计 \(K_o\) 使得 \(A^T-C^T K_o\) 具有期望极点，然后 \(L=K_o^T\)。

例题 (参考P49-P51) \(\dot{\textbf{x}} = \begin{bmatrix} 1 & 2 & 0 \\ 3 & -1 & 1 \\ 0 & 2 & 0 \end{bmatrix}\textbf{x} + \begin{bmatrix} 2 \\ 1 \\ 1 \end{bmatrix}u, y = \begin{bmatrix} 0 & 0 & 1 \end{bmatrix}\textbf{x}\) 期望观测器极点：\(-3, -3, -10\)。 \(\Delta_e(s) = (s+3)^2(s+10) = s^3+16s^2+69s+90\)。 \(\hat{\alpha}_2=16, \hat{\alpha}_1=69, \hat{\alpha}_0=90\)。系统 \((A,C)\) 能观测。原系统特征多项式 \(\det(sI-A) = s^3 - 9s + 2\)。\(\alpha_2=0, \alpha_1=-9, \alpha_0=2\)。使用基于可观标准型的变换法 (PPT P47-P48)： \(\bar{L} = \begin{bmatrix} \hat{\alpha}_0-\alpha_0 \\ \hat{\alpha}_1-\alpha_1 \\ \hat{\alpha}_2-\alpha_2 \end{bmatrix} = \begin{bmatrix} 90-2 \\ 69-(-9) \\ 16-0 \end{bmatrix} = \begin{bmatrix} 88 \\ 78 \\ 16 \end{bmatrix}\) 变换矩阵 \(P_o\) (使得 \(P_oAP_o^{-1}\) 为可观标准型，\(\bar{C}=CP_o^{-1}=[0 \dots 0 \ 1]\)) \(P_o^T = [(A^T)^2C^T \ A^TC^T \ C^T] \begin{bmatrix} 1 & \alpha_2 & \alpha_1 \\ 0 & 1 & \alpha_2 \\ 0 & 0 & 1 \end{bmatrix}\) (PPT P44的公式，这里的 \(\alpha_i\) 是 \(s^i\) 的系数) \(A^T = \begin{bmatrix} 1 & 3 & 0 \\ 2 & -1 & 2 \\ 0 & 1 & 0 \end{bmatrix}, C^T = \begin{bmatrix} 0 \\ 0 \\ 1 \end{bmatrix}\) \(A^TC^T = \begin{bmatrix} 0 \\ 2 \\ 0 \end{bmatrix}, (A^T)^2C^T = \begin{bmatrix} 6 \\ -2 \\ 2 \end{bmatrix}\) \(P_o^T = \begin{bmatrix} 6 & 0 & 0 \\ -2 & 2 & 0 \\ 2 & 0 & 1 \end{bmatrix} \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ -9 & 0 & 1 \end{bmatrix}\) (PPT P46的 \(\alpha_1=0, \alpha_2=-9, \alpha_3=2\) 对应 \(s^3+\alpha_1 s^2+\alpha_2 s+\alpha_3=0\)) \(P_o^T = \begin{bmatrix} 6 & 0 & 0 \\ -2 & 2 & 0 \\ -7 & 0 & 1 \end{bmatrix}\) \(P_o = \begin{bmatrix} 6 & -2 & -7 \\ 0 & 2 & 0 \\ 0 & 0 & 1 \end{bmatrix}\) \(P_o^{-1} = \begin{bmatrix} 1/6 & 1/6 & 7/6 \\ 0 & 1/2 & 0 \\ 0 & 0 & 1 \end{bmatrix}\) \(L = P_o^{-1}\bar{L} = \begin{bmatrix} 1/6 & 1/6 & 7/6 \\ 0 & 1/2 & 0 \\ 0 & 0 & 1 \end{bmatrix} \begin{bmatrix} 88 \\ 78 \\ 16 \end{bmatrix} = \begin{bmatrix} (88+78+112)/6 \\ 39 \\ 16 \end{bmatrix} = \begin{bmatrix} 278/6 \\ 39 \\ 16 \end{bmatrix} = \begin{bmatrix} 139/3 \\ 39 \\ 16 \end{bmatrix}\) 这与PPT P50结果一致。

3. 能检测性 (Detectability)

如果系统 \((A,C)\) 不是完全能观测的，但其所有不能观测的状态模式 (即 \(A_{\bar{o}}\) 的特征值) 都是稳定的 (即具有负实部)，则称该系统是能检测的。此时，虽然不能精确估计所有状态，但未被观测到的状态分量自身会衰减到零。

四、基于观测器的状态反馈 (Observer-Based State Feedback)

当状态 \(\textbf{x}\) 不能直接测量时，使用估计状态 \(\hat{\textbf{x}}\) 来实现反馈： \[ \textbf{u}(t) = \textbf{r}(t) - K\hat{\textbf{x}}(t) \] 将此控制律代入原系统和观测器方程： \(\dot{\textbf{x}} = A\textbf{x} + B(\textbf{r} - K\hat{\textbf{x}}) = A\textbf{x} - BK\hat{\textbf{x}} + B\textbf{r}\) \(\dot{\hat{\textbf{x}}} = A\hat{\textbf{x}} + B(\textbf{r} - K\hat{\textbf{x}}) + L(C\textbf{x} - C\hat{\textbf{x}})\) \(= (A-BK-LC)\hat{\textbf{x}} + LC\textbf{x} + B\textbf{r}\)

考虑状态 \(\textbf{x}\) 和估计误差 \(\textbf{e} = \textbf{x}-\hat{\textbf{x}}\) 构成的增广系统： \(\hat{\textbf{x}} = \textbf{x}-\textbf{e}\) \(\dot{\textbf{x}} = A\textbf{x} - BK(\textbf{x}-\textbf{e}) + B\textbf{r} = (A-BK)\textbf{x} + BK\textbf{e} + B\textbf{r}\) \(\dot{\textbf{e}} = (A-LC)\textbf{e}\) 增广系统的状态方程为： \[ \begin{bmatrix} \dot{\textbf{x}} \\ \dot{\textbf{e}} \end{bmatrix} = \begin{bmatrix} A-BK & BK \\ 0 & A-LC \end{bmatrix} \begin{bmatrix} \textbf{x} \\ \textbf{e} \end{bmatrix} + \begin{bmatrix} B \\ 0 \end{bmatrix} \textbf{r} \] 输出 \(\textbf{y} = C\textbf{x} = \begin{bmatrix} C & 0 \end{bmatrix} \begin{bmatrix} \textbf{x} \\ \textbf{e} \end{bmatrix}\) (若D=0)

分离原理 (Separation Principle)：上述增广系统的特征值为矩阵 \(\begin{bmatrix} A-BK & BK \\ 0 & A-LC \end{bmatrix}\) 的特征值。由于这是一个上三角分块矩阵，其特征值集合是 \(A-BK\) 的特征值集合与 \(A-LC\) 的特征值集合的并集。这意味着：

状态反馈增益 \(K\) 的设计（用于配置 \(A-BK\) 的极点）和观测器增益 \(L\) 的设计（用于配置 \(A-LC\) 的极点）可以独立进行。
如果状态反馈控制器使系统稳定，并且状态观测器是稳定的（即误差收敛），则整个闭环系统是稳定的。
通常，观测器的极点应配置得比控制器的极点“更快”（即实部更负，或模更小），以确保状态估计能迅速跟上实际状态的变化。一般建议观测器极点比控制器极点快2-5倍。

从参考输入 \(\textbf{r}\) 到输出 \(\textbf{y}\) 的传递函数与使用真实状态反馈时相同： \(G_{yr}(s) = C(sI-(A-BK))^{-1}B\) (当D=0时)

五、降维观测器 (Reduced-Order Observer)

如果系统的输出 \(\textbf{y}=C\textbf{x}\) 中，输出矩阵 \(C\) 的秩为 \(m\) (\(m

基本思想：

通过状态变换，将状态向量分为可直接由输出测量的部分 \(\textbf{x}_1\) (\(m\) 维) 和需要估计的部分 \(\textbf{x}_2\) (\(n-m\) 维)。设 \(P = \begin{bmatrix} C \\ R \end{bmatrix}\) (其中 \(R\) 的选择使得 \(P\) 非奇异)，则 \(\bar{\textbf{x}} = P\textbf{x} = \begin{bmatrix} C\textbf{x} \\ R\textbf{x} \end{bmatrix} = \begin{bmatrix} \textbf{y} \\ R\textbf{x} \end{bmatrix}\)。令 \(\bar{\textbf{x}}_1 = \textbf{y}\)，\(\bar{\textbf{x}}_2 = R\textbf{x}\)。
对变换后的系统 \((\bar{A}, \bar{B}, \bar{C})\)，其中 \(\bar{C} = CP^{-1} = \begin{bmatrix} I_m & 0 \end{bmatrix}\)。 \[ \begin{bmatrix} \dot{\bar{\textbf{x}}}_1 \\ \dot{\bar{\textbf{x}}}_2 \end{bmatrix} = \begin{bmatrix} \bar{A}_{11} & \bar{A}_{12} \\ \bar{A}_{21} & \bar{A}_{22} \end{bmatrix} \begin{bmatrix} \bar{\textbf{x}}_1 \\ \bar{\textbf{x}}_2 \end{bmatrix} + \begin{bmatrix} \bar{B}_1 \\ \bar{B}_2 \end{bmatrix} \textbf{u} \] \[ \textbf{y} = \bar{\textbf{x}}_1 \]
从上式中分离出关于 \(\bar{\textbf{x}}_2\) 的动态方程，并将其视为一个 \((n-m)\) 维的子系统，其“输入”包含 \(\textbf{u}\) 和已知的 \(\textbf{y}\) (即 \(\bar{\textbf{x}}_1\))，其“输出”是 \(\dot{\textbf{y}} - \bar{A}_{11}\textbf{y} - \bar{B}_1\textbf{u} = \bar{A}_{12}\bar{\textbf{x}}_2\)。即： \(\dot{\bar{\textbf{x}}}_2 = \bar{A}_{22}\bar{\textbf{x}}_2 + (\bar{A}_{21}\textbf{y} + \bar{B}_2\textbf{u})\) \(\textbf{y}_m = \bar{A}_{12}\bar{\textbf{x}}_2\)，其中 \(\textbf{y}_m = \dot{\textbf{y}} - \bar{A}_{11}\textbf{y} - \bar{B}_1\textbf{u}\)
为这个 \((n-m)\) 维的子系统 \((\bar{A}_{22}, \text{输入项}, \bar{A}_{12})\) 设计一个全维观测器来估计 \(\bar{\textbf{x}}_2\)。这个观测器的维数是 \(n-m\)。观测器方程为： \(\dot{\hat{\bar{\textbf{x}}}}_2 = \bar{A}_{22}\hat{\bar{\textbf{x}}}_2 + (\bar{A}_{21}\textbf{y} + \bar{B}_2\textbf{u}) + \bar{L}(\textbf{y}_m - \bar{A}_{12}\hat{\bar{\textbf{x}}}_2)\) \(= (\bar{A}_{22} - \bar{L}\bar{A}_{12})\hat{\bar{\textbf{x}}}_2 + \bar{A}_{21}\textbf{y} + \bar{B}_2\textbf{u} + \bar{L}(\dot{\textbf{y}} - \bar{A}_{11}\textbf{y} - \bar{B}_1\textbf{u})\)
由于上式中包含 \(\dot{\textbf{y}}\)，为避免使用微分器，引入辅助变量 \(z = \hat{\bar{\textbf{x}}}_2 - \bar{L}\textbf{y}\)。推导后得到降维观测器的最终形式 (参考P70-P71)： \[ \dot{\textbf{z}} = F\textbf{z} + G\textbf{y} + H\textbf{u} \] \[ \hat{\bar{\textbf{x}}}_2 = \textbf{z} + \bar{L}\textbf{y} \] 其中： \(F = \bar{A}_{22} - \bar{L}\bar{A}_{12}\) \(H = \bar{B}_2 - \bar{L}\bar{B}_1\) \(G = \bar{A}_{21} - \bar{L}\bar{A}_{11} + (\bar{A}_{22} - \bar{L}\bar{A}_{12})\bar{L} = \bar{A}_{21} - \bar{L}\bar{A}_{11} + F\bar{L}\)
最终的状态估计 \(\hat{\textbf{x}} = P^{-1} \begin{bmatrix} \textbf{y} \\ \hat{\bar{\textbf{x}}}_2 \end{bmatrix}\)。

降维观测器的极点由 \(F = \bar{A}_{22} - \bar{L}\bar{A}_{12}\) 的特征值决定，可以任意配置，只要子系统 \((\bar{A}_{22}, \bar{A}_{12})\) 能观测。分离原理对于降维观测器和状态反馈组成的系统仍然成立。闭环系统的极点是状态反馈配置的 \(n\) 个极点和降维观测器配置的 \(n-m\) 个极点之和。

例题 (参考P73-P76) \(A = \begin{bmatrix} 1 & 2 & 0 \\ 3 & -1 & 1 \\ 0 & 2 & 0 \end{bmatrix}, B = \begin{bmatrix} 0 \\ 0 \\ 1 \end{bmatrix}, C = \begin{bmatrix} -1 & 1 & 1 \end{bmatrix}\) 系统能观测，输出维数 \(m=1\)，状态维数 \(n=3\)，所以降维观测器维数为 \(n-m=2\)。步骤与PPT一致，最终得到 \(F, G, H, \bar{L}\) 以及 \(\hat{\textbf{x}}\) 的表达式。

六、总结

状态反馈是改变系统动态特性的有效手段，其核心是极点配置。
系统能控是任意配置闭环极点的前提。
状态观测器用于估计不可直接测量的状态变量。
系统能观测是任意配置观测器误差动态极点的前提。
分离原理保证了控制器和观测器可以独立设计，且组合后系统的极点为两者极点之和。
降维观测器可以减少观测器的阶数，当部分状态可由输出直接得到时使用。

本章内容是现代控制系统设计的核心，将理论应用于实践。