Question

我正在尝试使用Coq在正则表达式（RE）上形式化一些属性。但是，我有一些麻烦来证明一个相当简单的财产：

对于所有字符串s，如果s的语言为（epsilon）* RE，则s = “”，其中epsilon和*表示空字符串RE和Kleene星操作

这似乎是归纳/反演策略的明显应用，但我无法使其发挥作用。

带有问题引理的最小工作代码位于以下gist。关于我应该如何进行的任何提示将不胜感激。

修改：

我的一次尝试是这样的：

Lemma star_lemma : forall s, s <<- (#1 ^*) -> s = "".
Proof.  
  intros s H.
  inverts* H.
  inverts* H2.
  inverts* H1.
  inverts* H1.
  inverts* H2.
  simpl in *.
  -- stuck here

让我有以下目标：

s' : string
H4 : s' <<- (#1 ^*)
============================
s' = ""

至少在我看来，使用感应会完成证明，因为我可以在感应假设中使用H4来完成证明，但是当我使用

开始证明时

induction H

而不是

inverts* H

我得到了一些（至少对我而言）毫无意义的目标。在Idris / Agda中，这种证明紧跟在s＆lt; - （＃1 ^ *）的结构上的模式匹配和递归之后。我的观点是如何在Coq中进行这样的递归。

Answer 1

我修改了in_regex谓词的定义：

Inductive in_regex : string -> regex -> Prop :=
| InEps
  : "" <<- #1
| InChr
  : forall c
  , (String c EmptyString) <<- ($ c)
| InCat
  :  forall e e' s s' s1
  ,  s <<- e
  -> s' <<- e'
  -> s1 = s ++ s'
  -> s1 <<- (e @ e')
| InLeft
  :  forall s e e'
  ,  s <<- e
  -> s <<- (e :+: e')
| InRight
  :  forall s' e e'
  ,  s' <<- e'
  -> s' <<- (e :+: e')
| InStarLeft
  : forall e
  , "" <<- (e ^*)
| InStarRight
  :  forall s s' e
  ,  s <<- e
  -> s' <<- (e ^*)
  -> (s ++ s') <<- (e ^*)
where "s '<<-' e" := (in_regex s e).

并且可以证明你的引理：

Lemma star_lemma : forall s, s <<- (#1 ^*) -> s = "".
Proof.
  intros s H.
  remember (#1 ^*) as r.
  induction H; inversion Heqr; clear Heqr; trivial.
  subst e.
  rewrite IHin_regex2; trivial.
  inversion H; trivial.
Qed.

有些解释是必要的。

我在H上做了归纳。理由是：如果我有s <<- (#1 ^*)证明，则此证明必须具有以下形式...
策略remember创建了一个新的假设Heqr，与inversion结合将有助于摆脱无法提供此证明的案例（实际上所有案例减去^*在结论中的那些。）
不幸的是，这种推理路径不适用于in_regex谓词的定义，因为它会为归纳假设创造一个不可满足的条件。这也是我修改你的归纳谓词的原因。
修改后的归纳法试图给出(e ^*)中更为基本的定义。在语义上，我认为这是等价的。

我有兴趣阅读有关原始问题的证明。

Answer 2

以下是原始问题的一种可能解决方案：

Lemma star_lemma : forall s,
    s <<- (#1 ^*) -> s = "".
Proof.
  refine (fix star_lemma s prf {struct prf} : s = "" := _).
  inversion_clear prf; subst.
  inversion_clear H; subst.
  - now inversion H0.
  - inversion_clear H0; subst. inversion_clear H; subst.
    rewrite (star_lemma s' H1).
    reflexivity.
Qed.

主要思想是在上下文中引入一个类似于典型Idris证明中的递归调用的术语。 remember和dependent induction的方法效果不佳（没有in_regex的修改），因为它们引入了不可能满足方程作为归纳假设的前提。

注意：检查此引理可能需要一段时间（在Coq 8.5pl3下我的机器上大约40秒）。我认为这是因为inversion策略往往会产生很大的证明条件。

Answer 3

这个问题困扰了我一个星期，我终于找到了一个我觉得优雅的解决方案。

我已经读过，当感应原理不符合您的需求时，您可以编写并证明另一个，更适合您的问题。这就是我在这种情况下所做的。我们想要的是使用this answer中给出的更自然的定义时获得的那个。通过这样做，我们可以保持相同的定义（例如，如果更改它意味着太多的更改），并且更容易理解它。

以下是归纳原理的证明（我使用一节来精确指定隐式参数，因为否则我会观察到它们的奇怪行为，但这里根本没有必要的部分机制。）

scale: 0.4

事实证明，这个证据的Section induction_principle. Context (P : string -> regex -> Prop) (H_InEps : P "" #1) (H_InChr : forall c, P (String c "") ($ c)) (H_InCat : forall {e e' s s' s1}, s <<- e -> P s e -> s' <<- e' -> P s' e' -> s1 = s ++ s' -> P s1 (e @ e')) (H_InLeft : forall {s e e'}, s <<- e -> P s e -> P s (e :+: e')) (H_InRight : forall {s' e e'}, s' <<- e' -> P s' e' -> P s' (e :+: e')) (H_InStar_Eps : forall e, P "" (e ^*)) (H_InStar_Cat : forall {s1 s2 e}, s1 <<- e -> s2 <<- (e ^*) -> P s1 e -> P s2 (e ^*) -> P (s1++s2) (e ^*)). Arguments H_InCat {_ _ _ _ _} _ _ _ _ _. Arguments H_InLeft {_ _ _} _ _. Arguments H_InRight {_ _ _} _ _. Arguments H_InStar_Cat {_ _ _} _ _ _ _. Definition in_regex_ind2 : forall (s : string) (r : regex), s <<- r -> P s r. Proof. refine (fix in_regex_ind2 {s r} prf {struct prf} : P s r := match prf with | InEps => H_InEps | InChr c => H_InChr c | InCat prf1 prf2 eq1 => H_InCat prf1 (in_regex_ind2 prf1) prf2 (in_regex_ind2 prf2) eq1 | InLeft _ prf => H_InLeft prf (in_regex_ind2 prf) | InRight _ prf => H_InRight prf (in_regex_ind2 prf) | InStar prf => _ end). inversion prf; subst. - inversion H1. apply H_InStar_Eps. - inversion H1; subst. apply H_InStar_Cat; try assumption; apply in_regex_ind2; assumption. Qed. End induction_principle.不是即时的（可能是因为Qed产生了this answer中的大项，但是花了不到1秒（可能是因为引理更抽象）。

inversion变得几乎微不足道（一旦我们知道star_lemma技巧），就像自然定义一样。

remember

坚持关于正则表达式的简单证明

3 个答案: