C++ 快速查找某个值是否存在于 C 数组中

发布于09月04日

我有一个具有时间关键型ISR的嵌入式应用程序，它需要迭代大小为256的数组(最好是1024，但256是最小值)，并判断值是否与数组内容匹配.如果是这样，bool将设置为true.

The microcontroller is an NXP LPC4357, ARM Cortex M4 core, and the compiler is GCC. I already have combined optimisation level 2 (3 is slower) and placing the function in RAM instead of flash. I also use pointer arithmetic and a for loop, which does down-counting instead of up (checking if i!=0 is faster than checking if i<256). All in all, I end up with a duration of 12.5 µs which has to be reduced drastically to be feasible. This is the (pseudo) code I use now:

uint32_t i;
uint32_t *array_ptr = &theArray[0];
uint32_t compareVal = 0x1234ABCD;
bool validFlag = false;

for (i=256; i!=0; i--)
{
    if (compareVal == *array_ptr++)
    {
         validFlag = true;
         break;
     }
}

最快的方法是什么？允许使用内联汇编.其他"不那么优雅"的把戏也被允许.

; r0 = count, r1 = source ptr, r2 = comparison value stmfd sp!,{r4-r11} ; save non-volatile registers mov r3,r0,LSR #3 ; loop count = total count / 8 pld [r1,#128] ldmia r1!,{r4-r7} ; pre load first set loop_top: pld [r1,#128] ldmia r1!,{r8-r11} ; pre load second set cmp r4,r2 ; search for match cmpne r5,r2 ; use conditional execution to avoid extra branch instructions cmpne r6,r2 cmpne r7,r2 beq found_it ldmia r1!,{r4-r7} ; use 2 sets of registers to hide load delays cmp r8,r2 cmpne r9,r2 cmpne r10,r2 cmpne r11,r2 beq found_it subs r3,r3,#1 ; decrement loop count bne loop_top mov r0,#0 ; return value = false (not found) ldmia sp!,{r4-r11} ; restore non-volatile registers bx lr ; return found_it: mov r0,#1 ; return true ldmia sp!,{r4-r11} bx lr

C++ 快速查找某个值是否存在于 C 数组中

推荐答案

C++相关问答推荐

librsvg rsvg_handle_get_dimensions获取像素大小与浏览器中的渲染大小没有不同

为什么已经设置的值在C中被重置为for循环条件中的新值？

C中的attributor((aligned(4)，packed))与 struct 的用法

正在try 将文件/文件夹名从目录 struct 存储到链接列表

GCC创建应用于移动项的单独位掩码的目的是什么？

进程在写入管道时挂起

为什么此共享库没有预期的依赖项？

FRIDA-服务器成为端口扫描的目标？

一旦运行长度超过2，编译器是否会优化"；strnlen(mystring，32)>；2"；以停止循环？

ifdef __cplusplus中的整数文字单引号

#定义SSL_CONNECTION_NO_CONST

在C语言中，指针指向一个数组

C堆栈(使用动态数组)realloc内存泄漏问题

这段代码用于在C中以相反的顺序打印数组，但它不起作用

Go和C中的数据 struct 对齐差异

宏观；S C调深度

有没有办法减少C语言中线程的堆大小？

为什么孤儿进程在 Linux 中没有被 PID 1 采用，就像我读过的一本书中声称的那样？

如何修复数组数据与列标题未对齐的问题？

使用邻接表创建图