在 CC++ 中获得正模的最快方法

发布于09月26日

通常在我的内部循环中，我需要以"回绕"的方式索引一个数组，这样(例如)如果数组大小是my_array[index % array_size]，并且我的代码要求元素-2，那么它应该被赋予元素98.在许多高级语言(如Python)中，只需使用my_array[index % array_size]就可以做到这一点，但是由于某些原因，C的整数算术(通常)向零舍入，而不是一致地向下舍入，因此，当第一个参数为负时，其模运算符返回负结果.

通常我知道index不小于-array_size，在这种情况下，我只做my_array[(index + array_size) % array_size].然而，有时这是不能保证的，对于这些情况，我想知道实现总是正模函数的最快方法.有几种不需要分支的"聪明"方法，例如

inline int positive_modulo(int i, int n) {
    return (n + (i % n)) % n;
}

或

inline int positive_modulo(int i, int n) {
    return (i % n) + (n * (i < 0));
}

Of course I can profile these to find out which is the fastest on my system, but I can't help w或rying that I might have missed a better one, 或 that what's fast on my machine might be slow on a different one.

So is there a standard way to do this, 或 some clever trick that I've missed that's likely to be the fastest possible way?

Also, I know it's probably wishful thinking, but if there's a way of doing this that can be auto-vect或ised, that would be amazing.

modulo256(int): # @modulo256(int) mov edx, edi sar edx, 31 shr edx, 24 lea eax, [rdi+rdx] movzx eax, al sub eax, edx lea edx, [rax+256] test eax, eax cmovs eax, edx ret

在 CC++ 中获得正模的最快方法

推荐答案

C++相关问答推荐

intellisense不工作，甚至已经下载了c/c++扩展

无效使用未定义类型'structsquare'？

C指针地址和转换

找出文件是否包含给定的文件签名

如果实际的syscall是CLONE()，那么为什么strace接受fork()呢？

编译的时候g++通常会比GCC慢很多吗？

双指针指向常量双指针的指针类型赋值不兼容

我的程序在收到SIGUSR1信号以从PAUSE()继续程序时总是崩溃()

错误Cygwin_Except：：Open_stackdupfile：正在转储堆栈跟踪是什么？

S的这种管道实施有什么问题吗？

在C中创建任意类型的只读指针参数

Caesar密码调试：输出文本末尾的问号和随机字符

C语言中MPI发送接收字符串时出现的分段错误

基于蝶数恰好有8个除数的事实的代码

即使我在C++中空闲，也肯定会丢失内存

是否有单独的缓冲区用于读写库调用？

未为同一文件中的函数执行DirectFunctionCall

即使客户端不发送数据，也会发生UNIX套接字读取

使用fread()函数读取txt文件

将帧从相机 (/dev/video0) 复制到帧缓冲区 (/dev/fb0) 会产生意外结果