C++ 编译器中的 8 位布尔值.对它们的操作效率低吗

发布于11月12日

我正在读Agner Fog的"Optimizing software in C++"(针对英特尔、AMD和VIA的x86处理器)，它在第34页上写道

布尔变量存储为8位整数，值0表示FALSE，值1表示TRUE. 布尔变量在某种意义上是超定的，即所有具有布尔值的运算符变量作为输入判断输入是否具有除0或1以外的任何其他值，但运算符将布尔值作为输出只能生成0或1以外的值.这使得操作以布尔变量作为输入，效率比必要的要低.

这在今天仍然是正确的吗？在哪些编译器上呢？你能举个例子吗？作者说，

如果它可以提高布尔运算的效率，则可以使布尔运算变得更有效率可以肯定地知道，操作数除了0和1之外没有其他值. 编译器为什么不做这样的假设是因为变量可能有其他值(如果它们未初始化或来自未知来源).

这是否意味着，如果我以函数指针bool(*)()为例调用它，那么对它的操作会产生低效的代码？或者，当我通过解引用指针或读取引用来访问布尔值，然后对其进行操作时，会是这种情况吗？

bool logical_or(bool a, bool b) { return a||b; } # gcc4.6.4 -O3 for the x86-64 System V ABI test dil, dil # test a against itself (for non-zero) mov eax, 1 cmove eax, esi # return a ? 1 : b; ret

logical_or PROC ; x86-64 MSVC CL19 test cl, cl ; Windows ABI passes args in ecx, edx jne SHORT $LN3@logical_or test dl, dl jne SHORT $LN3@logical_or xor al, al ; missed peephole: xor eax,eax is strictly better ret 0 $LN3@logical_or: mov al, 1 ret 0 logical_or ENDP

logical_or(bool, bool): # ICC18 xor eax, eax #4.42 movzx edi, dil #4.33 movzx esi, sil #4.33 or edi, esi #4.42 setne al #4.42 ret #4.42

# hand-written implementation that no compilers come close to making select: mov eax, edx # retval = x test edi, esi # ZF = ((a & b) == 0) cmovz eax, ecx # conditional move: return y if ZF is set ret

select: # clang 6.0 trunk 317877 nightly build on Godbolt test esi, esi cmove edx, ecx # x = b ? y : x test edi, edi cmove edx, ecx # x = a ? y : x mov eax, edx # return x ret

select(bool, bool, int, int): # gcc 8.0.0-pre 20171110 test dil, dil mov eax, edx ; compiling with -mtune=intel or -mtune=haswell would keep test/jcc together for macro-fusion. je .L8 test sil, sil je .L8 rep ret .L8: mov eax, ecx ret

select PROC test cl, cl ; a je SHORT $LN3@select mov eax, r8d ; retval = x test dl, dl ; b jne SHORT $LN4@select $LN3@select: mov eax, r9d ; retval = y $LN4@select: ret 0 ; 0 means rsp += 0 after popping the return address, not C return 0. ; MSVC doesn't emit the `ret imm16` opcode here, so IDK why they put an explicit 0 as an operand. select ENDP

select(bool, bool, int, int): test dil, dil #8.13 je ..B4.4 # Prob 50% #8.13 test sil, sil #8.16 jne ..B4.5 # Prob 50% #8.16 ..B4.4: # Preds ..B4.2 ..B4.1 mov edx, ecx #8.13 ..B4.5: # Preds ..B4.2 ..B4.4 mov eax, edx #8.13 ret #8.13

;; MSVC CL19 -Ox = full optimization select2 PROC test cl, cl je SHORT $LN3@select2 test dl, dl je SHORT $LN3@select2 mov al, 1 ; ab = 1 test al, al ;; and then test/cmov on an immediate constant!!! cmovne r9d, r8d mov eax, r9d ret 0 $LN3@select2: xor al, al ;; ab = 0 test al, al ;; and then test/cmov on another path with known-constant condition. cmovne r9d, r8d mov eax, r9d ret 0 select2 ENDP

Combine bool with bitwise operators helps MSVC and ICC

在我非常有限的测试中，对于MSVC和ICC，|和&似乎比||和&&工作得更好.使用编译器+编译选项查看您自己的代码的编译器输出，看看会发生什么.

int select_bitand(bool a, bool b, int x, int y) { return (a&b) ? x : y; }

Gcc still branches separately在两个输入的单独test个上，代码与select的其他版本相同.clang still does two separate 102，与其他源版本的ASM相同.

MSVC通过并正确优化，击败了所有其他编译器(至少在独立定义中):

select_bitand PROC ;; MSVC test cl, dl ;; ZF = !(a & b) cmovne r9d, r8d mov eax, r9d ;; could have done the mov to eax in parallel with the test, off the critical path, but close enough. ret 0

ICC18浪费了两条movzx指令，将bools扩展为int，然后生成与MSVC相同的代码

select_bitand: ## ICC18 movzx edi, dil #16.49 movzx esi, sil #16.49 test edi, esi #17.15 cmovne ecx, edx #17.15 mov eax, ecx #17.15 ret #17.15

C++ 编译器中的 8 位布尔值.对它们的操作效率低吗

推荐答案

当前GCC/clang中错过的优化:

Combine `bool` with bitwise operators helps MSVC and ICC

C++相关问答推荐

如何在不修改字符串缓冲区早期使用的情况下覆盖字符串缓冲区

为什么已经设置的值在C中被重置为for循环条件中的新值？

你能用自己的地址声明一个C指针吗？

如何在IF语句中正确使用0.0

如何创建一个C程序来存储5种动物的名字，并在用户 Select 其中任何一种动物时打印内存地址？

在C语言中，在数学运算过程中，为什么浮点数在变量中的行为不同

解决S随机内存分配问题，实现跨进程高效数据共享

Kdb：仅升级指定的列

<；unistd.h>；和<；sys/unistd.h>；之间有什么区别？

如何在VS 2022中正确安装额外的C头文件

For循环不会迭代所有字符串字符吗？(初学者问题)

如何在VSCode中创建和使用我自己的C库？

C：如何将此代码转换为与数组一起使用？

从不兼容的指针类型返回&&警告，但我看不出原因

GetText不适用于包含国际字符的帐户名称

为什么会导致分段故障？(C语言中的一个程序，统计文件中某个单词的出现次数)

浮点正零何时不全为零？

WSASocket在哪里定义？

Struct 内的数组赋值

是什么阻止编译器优化手写的 memcmp()？

推荐答案

当前GCC/clang中错过的优化:

Combine bool with bitwise operators helps MSVC and ICC

C++相关问答推荐

Combine `bool` with bitwise operators helps MSVC and ICC