内联汇编语言是否比本机 C++ 代码慢

发布于03月07日

I tried to compare the performance of inline assembly language and C++ code, so I wrote a function that add two arrays of size 2000 for 100000 times. Here's the code:

#define TIMES 100000
void calcuC(int *x,int *y,int length)
{
    for(int i = 0; i < TIMES; i++)
    {
        for(int j = 0; j < length; j++)
            x[j] += y[j];
    }
}


void calcuAsm(int *x,int *y,int lengthOfArray)
{
    __asm
    {
        mov edi,TIMES
        start:
        mov esi,0
        mov ecx,lengthOfArray
        label:
        mov edx,x
        push edx
        mov eax,DWORD PTR [edx + esi*4]
        mov edx,y
        mov ebx,DWORD PTR [edx + esi*4]
        add eax,ebx
        pop edx
        mov [edx + esi*4],eax
        inc esi
        loop label
        dec edi
        cmp edi,0
        jnz start
    };
}

Here's main():

int main() {
    bool errorOccured = false;
    setbuf(stdout,NULL);
    int *xC,*xAsm,*yC,*yAsm;
    xC = new int[2000];
    xAsm = new int[2000];
    yC = new int[2000];
    yAsm = new int[2000];
    for(int i = 0; i < 2000; i++)
    {
        xC[i] = 0;
        xAsm[i] = 0;
        yC[i] = i;
        yAsm[i] = i;
    }
    time_t start = clock();
    calcuC(xC,yC,2000);

    //    calcuAsm(xAsm,yAsm,2000);
    //    for(int i = 0; i < 2000; i++)
    //    {
    //        if(xC[i] != xAsm[i])
    //        {
    //            cout<<"xC["<<i<<"]="<<xC[i]<<" "<<"xAsm["<<i<<"]="<<xAsm[i]<<endl;
    //            errorOccured = true;
    //            break;
    //        }
    //    }
    //    if(errorOccured)
    //        cout<<"Error occurs!"<<endl;
    //    else
    //        cout<<"Works fine!"<<endl;

    time_t end = clock();

    //    cout<<"time = "<<(float)(end - start) / CLOCKS_PER_SEC<<"\n";

    cout<<"time = "<<end - start<<endl;
    return 0;
}

Then I run the program five times to get the cycles of processor, which could be seen as time. Each time I call one of the function mentioned above only.

And here comes the result.

Function of assembly version:

Debug   Release
---------------
732        668
733        680
659        672
667        675
684        694
Average:   677

Function of C++ version:

Debug     Release
-----------------
1068      168
 999      166
1072      231
1002      166
1114      183
Average:  182

The C++ code in release mode is almost 3.7 times faster than the assembly code. Why?

I guess that the assembly code I wrote is not as effective as those generated by GCC. It's hard for a common programmer like me to wrote code faster than its opponent generated by a compiler.Does that mean I should not trust the performance of assembly language written by my hands, focus on C++ and forget about assembly language?

内联汇编语言是否比本机 C++ 代码慢

Function of assembly version:

Function of C++ version:

推荐答案

C++相关问答推荐

我可以动态分配具有空类型函数的矩阵吗？

C编译器是否遵循restrict的正式定义？

如果我释放其他内容，返回值就会出错

我无法让LLDB正确运行我的可执行文件

如何使用C for Linux和Windows的标准输入与gdb/mi进行通信？

使用错误的命令执行程序

在C++中访问双指针

CC2538裸机项目编译但不起作用

指向不同类型的指针是否与公共初始序列规则匹配？

覆盖读取函数，但当文件描述符为3或4时，我有问题

使用C++中的字符串初始化 struct 时，从‘char*’初始化‘char’使指针变为整数，而不进行强制转换

当用C打印过多的'；\n'；时输出不正确

共享目标代码似乎不能在Linux上的进程之间共享

Linux Posix消息队列

如何解释数组中的(ptr)和(ptr+2)？

分配给静态变量和动态变量的位置之间有区别吗？

将char*铸造为空**

GDB 用内容初始化数组

多行表达式：C 编译器如何处理换行符？

文件指针引起的C程序分段错误