x86 branchless implementation of unsigned int32 -> float conversion

Based on a suggestion by Rémi Hutin. The new implementation produces smaller code (3 FP instructions and 2 integer "and") and has no conditional branch. On average, it is a little slower than the old implementation. Hence both implementations are supported: - with -ffavor-branchless, use the new implementation - by default, use the old, branching implementation
author: Xavier Leroy <xavier.leroy@college-de-france.fr> 2019-05-17 16:17:51 +0200
committer: Xavier Leroy <xavier.leroy@college-de-france.fr> 2019-06-07 09:08:40 +0200
commit: b3bfdc3655479f7f8f1c5e3a7571473a72b421cb (patch)
tree: 8fdc27ef9b7c186ee3af9c04327a28b7e8424fd1 /x86/SelectOpproof.v
parent: 8e3a73448c5ddfa4be3871d7f4fd80281a7549f4 (diff)
download: compcert-b3bfdc3655479f7f8f1c5e3a7571473a72b421cb.tar.gz
compcert-b3bfdc3655479f7f8f1c5e3a7571473a72b421cb.zip
1 files changed, 15 insertions, 3 deletions
diff --git a/x86/SelectOpproof.v b/x86/SelectOpproof.v
index b412ccf7..908d6e4a 100644
--- a/x86/SelectOpproof.v
+++ b/x86/SelectOpproof.v
@@ -870,10 +870,22 @@ Theorem eval_floatofintu:
   exists v, eval_expr ge sp e m le (floatofintu a) v /\ Val.lessdef y v.
 Proof.
   intros until y; unfold floatofintu. case (floatofintu_match a); intros.
-  InvEval. TrivialExists.
-  destruct x; simpl in H0; try discriminate. inv H0.
+- InvEval. TrivialExists.
+- destruct x; simpl in H0; try discriminate. inv H0.
   exists (Vfloat (Float.of_intu i)); split; auto.
-  econstructor. eauto.
+  destruct (favor_branchless tt).
++ econstructor. eauto.
+  assert (eval_expr ge sp e m (Vint i :: le) (Eletvar O) (Vint i)) by (constructor; auto).
+  exploit (eval_andimm Float.ox7FFF_FFFF (Vint i :: le) (Eletvar 0)); eauto.
+  simpl. intros [v1 [A1 B1]]. inv B1.
+  exploit (eval_andimm Float.ox8000_0000 (Vint i :: le) (Eletvar 0)); eauto.
+  simpl. intros [v2 [A2 B2]]. inv B2.
+  unfold subf. econstructor.
+  constructor. econstructor. constructor. eexact A1. constructor. simpl; eauto.
+  constructor. econstructor. constructor. eexact A2. constructor. simpl; eauto.
+  constructor.
+  simpl. rewrite Float.of_intu_of_int_3. auto.
++ econstructor. eauto.
   set (fm := Float.of_intu Float.ox8000_0000).
   assert (eval_expr ge sp e m (Vint i :: le) (Eletvar O) (Vint i)).
     constructor. auto.
author	Xavier Leroy <xavier.leroy@college-de-france.fr>	2019-05-17 16:17:51 +0200
committer	Xavier Leroy <xavier.leroy@college-de-france.fr>	2019-06-07 09:08:40 +0200
commit	b3bfdc3655479f7f8f1c5e3a7571473a72b421cb (patch)
tree	8fdc27ef9b7c186ee3af9c04327a28b7e8424fd1 /x86/SelectOpproof.v
parent	8e3a73448c5ddfa4be3871d7f4fd80281a7549f4 (diff)
download	compcert-b3bfdc3655479f7f8f1c5e3a7571473a72b421cb.tar.gz compcert-b3bfdc3655479f7f8f1c5e3a7571473a72b421cb.zip