Python 这个递归函数能否变成具有类似性能的迭代函数

发布于08月23日

我正在使用Numba编写一个使用Python语言的函数来标记2D或3D数组中的对象，这意味着输入数组中具有相同值的所有正交连接的单元将在输出数组中被赋予从1到N的唯一标签，其中N是正交连接的组数.它非常类似于scipy.ndimage.label之类的函数以及SCRKIT-IMAGE等库中的类似函数，但这些函数标记了所有正交连接的非零单元组，因此它会合并具有不同值的连通组，这是我不想要的.例如，给定以下输入:

[0 0 7 7 0 0
 0 0 7 0 0 0
 0 0 0 0 0 7
 0 6 6 0 0 7
 0 0 4 4 0 0]

Scipy函数将返回

[0 0 1 1 0 0
 0 0 1 0 0 0
 0 0 0 0 0 3
 0 2 2 0 0 3
 0 0 2 2 0 0]

请注意，6和4被合并到标签2中.我希望将它们标记为单独的组，例如:

[0 0 1 1 0 0
 0 0 1 0 0 0
 0 0 0 0 0 4
 0 2 2 0 0 4
 0 0 3 3 0 0]

我asked this about a year ago岁了，一直在使用公认答案中的解决方案，然而我正在努力优化我的代码的运行时，并重新考虑这个问题.

对于我通常使用的数据大小，链接的解决方案运行大约1m30秒.我写了下面的递归算法，它像普通的Python一样运行大约30秒，而Numba的JIT运行在1-2秒(侧注，我讨厌邻近的函数，任何让它不那么杂乱，同时仍然与Numba兼容的技巧都将不胜感激):

@numba.njit
def adjacent(idx, shape):
    coords = []
    if len(shape) > 2:
        if idx[0] < shape[0] - 1:
            coords.append((idx[0] + 1, idx[1], idx[2]))
        if idx[0] > 0:
            coords.append((idx[0] - 1, idx[1], idx[2]))
        if idx[1] < shape[1] - 1:
            coords.append((idx[0], idx[1] + 1, idx[2]))
        if idx[1] > 0:
            coords.append((idx[0], idx[1] - 1, idx[2]))
        if idx[2] < shape[2] - 1:
            coords.append((idx[0], idx[1], idx[2] + 1))
        if idx[2] > 0:
            coords.append((idx[0], idx[1], idx[2] - 1))
    else:
        if idx[0] < shape[0] - 1:
            coords.append((idx[0] + 1, idx[1]))
        if idx[0] > 0:
            coords.append((idx[0] - 1, idx[1]))
        if idx[1] < shape[1] - 1:
            coords.append((idx[0], idx[1] + 1))
        if idx[1] > 0:
            coords.append((idx[0], idx[1] - 1))
    return coords


@numba.njit
def apply_label(labels, decoded_image, current_label, idx):
    labels[idx] = current_label
    for aidx in adjacent(idx, labels.shape):
        if decoded_image[aidx] == decoded_image[idx] and labels[aidx] == 0:
            apply_label(labels, decoded_image, current_label, aidx)


@numba.njit
def label_image(decoded_image):
    labels = np.zeros_like(decoded_image, dtype=np.uint32)
    current_label = 0
    for idx in zip(*np.where(decoded_image >= 0)):
        if labels[idx] == 0:
            current_label += 1
            apply_label(labels, decoded_image, current_label, idx)
    return labels, current_label

这对一些数据有效，但在其他数据上崩溃，我发现问题是当有非常大的对象要标记时，达到了递归限制.我试图重写label_image以不使用递归，但现在使用Numba需要大约10秒.与我开始的地方相比，这仍然是一个巨大的改进，但似乎它应该可以获得与递归版本相同的性能.以下是我的迭代版本:

@numba.njit
def label_image(decoded_image):
    labels = np.zeros_like(decoded_image, dtype=np.uint32)
    current_label = 0
    for idx in zip(*np.where(decoded_image >= 0)):
        if labels[idx] == 0:
            current_label += 1
            idxs = [idx]
            while idxs:
                cidx = idxs.pop()
                if labels[cidx] == 0:
                    labels[cidx] = current_label
                    for aidx in adjacent(cidx, labels.shape):
                        if labels[aidx] == 0 and decoded_image[aidx] == decoded_image[idx]:
                            idxs.append(aidx)
    return labels, current_label

有什么办法可以改进这一点吗？

def label_image(decoded_image): shape = decoded_image.shape labels = np.zeros_like(decoded_image, dtype=np.uint32) current_label = 0 for idx in zip(*np.where(decoded_image > 0)): if labels[idx] == 0: current_label += 1 stack = [idx] while stack: top = stack.pop() labels[top] = current_label for i in range(0, len(shape)): if top[i] > 0: neighbor = list(top) neighbor[i] -= 1 neighbor = tuple(neighbor) if decoded_image[neighbor] == decoded_image[idx] and labels[neighbor] == 0: stack.append(neighbor) if top[i] < shape[i] - 1: neighbor = list(top) neighbor[i] += 1 neighbor = tuple(neighbor) if decoded_image[neighbor] == decoded_image[idx] and labels[neighbor] == 0: stack.append(neighbor) return labels

@numba.njit def label_image_2d(decoded_image): w, h = decoded_image.shape labels = np.zeros_like(decoded_image, dtype=np.uint32) current_label = 0 for idx in zip(*np.where(decoded_image > 0)): if labels[idx] == 0: current_label += 1 stack = [idx] while stack: x, y = stack.pop() if decoded_image[x, y] != decoded_image[idx] or labels[x, y] != 0: continue # already visited or not part of this group labels[x, y] = current_label if x > 0: stack.append((x-1, y)) if x+1 < w: stack.append((x+1, y)) if y > 0: stack.append((x, y-1)) if y+1 < h: stack.append((x, y+1)) return labels @numba.njit def label_image_3d(decoded_image): w, h, l = decoded_image.shape labels = np.zeros_like(decoded_image, dtype=np.uint32) current_label = 0 for idx in zip(*np.where(decoded_image > 0)): if labels[idx] == 0: current_label += 1 stack = [idx] while stack: x, y, z = stack.pop() if decoded_image[x, y, z] != decoded_image[idx] or labels[x, y, z] != 0: continue # already visited or not part of this group labels[x, y, z] = current_label if x > 0: stack.append((x-1, y, z)) if x+1 < w: stack.append((x+1, y, z)) if y > 0: stack.append((x, y-1, z)) if y+1 < h: stack.append((x, y+1, z)) if z > 0: stack.append((x, y, z-1)) if z+1 < l: stack.append((x, y, z+1)) return labels def label_image(decoded_image): dim = len(decoded_image.shape) if dim == 2: return label_image_2d(decoded_image) assert dim == 3 return label_image_3d(decoded_image)

@numba.njit def label_image_2d(decoded_image): w, h = decoded_image.shape labels = np.zeros_like(decoded_image, dtype=np.uint32) current_label = 0 stack = [] for sx in range(0, w): for sy in range(0, h): start = (sx, sy) image_label = decoded_image[start] if image_label <= 0 or labels[start] != 0: continue current_label += 1 stack.append(start) while stack: x, y = stack.pop() if decoded_image[x, y] != image_label or labels[x, y] != 0: continue # already visited or not part of this group labels[x, y] = current_label if x > 0: stack.append((x-1, y)) if x+1 < w: stack.append((x+1, y)) if y > 0: stack.append((x, y-1)) if y+1 < h: stack.append((x, y+1)) return labels @numba.njit def label_image_3d(decoded_image): w, h, l = decoded_image.shape labels = np.zeros_like(decoded_image, dtype=np.uint32) current_label = 0 stack = [] for sx in range(0, w): for sy in range(0, h): for sz in range(0, l): start = (sx, sy, sz) image_label = decoded_image[start] if image_label <= 0 or labels[start] != 0: continue current_label += 1 stack.append(start) while stack: x, y, z = stack.pop() if decoded_image[x, y, z] != image_label or labels[x, y, z] != 0: continue # already visited or not part of this group labels[x, y, z] = current_label if x > 0: stack.append((x-1, y, z)) if x+1 < w: stack.append((x+1, y, z)) if y > 0: stack.append((x, y-1, z)) if y+1 < h: stack.append((x, y+1, z)) if z > 0: stack.append((x, y, z-1)) if z+1 < l: stack.append((x, y, z+1)) return labels

import numpy as np import timeit import rec import iter_old import iter_new import merge shape = (100, 100, 100) n = 20 for module in [rec, iter_old, iter_new, merge]: print(module) label_image = module.label_image # Trigger compilation of 2d & 3d functions label_image(np.zeros((1, 1))) label_image(np.zeros((1, 1, 1))) i = 0 def test_full(): global i i += 1 label_image(np.full(shape, i)) print("single group:", timeit.timeit(test_full, number=n)) print("random (few groups):", timeit.timeit( lambda: label_image(np.random.randint(low = 1, high = 10, size = shape)), number=n)) print("random (many groups):", timeit.timeit( lambda: label_image(np.random.randint(low = 1, high = 400, size = shape)), number=n)) print("only groups:", timeit.timeit( lambda: label_image(np.arange(np.prod(shape)).reshape(shape)), number=n))

<module 'rec' from '...'> single group: 32.39212468900041 random (few groups): 14.648884047001047 random (many groups): 13.304533919001187 only groups: 13.513677138000276 <module 'iter_old' from '...'> single group: 10.287227957000141 random (few groups): 17.37535468200076 random (many groups): 14.506630064999626 only groups: 13.132202609998785 <module 'iter_new' from '...'> single group: 7.388022166000155 random (few groups): 11.585243002000425 random (many groups): 9.560101995000878 only groups: 8.693653742000606 <module 'merge' from '...'> single group: 14.657021331999204 random (few groups): 14.146574055999736 random (many groups): 13.412314713001251 only groups: 12.642367746000673

<module 'iter_old' from '...'> single group: 3.5357716739999887 random (few groups): 4.931695729999774 random (many groups): 3.4671142009992764 only groups: 3.3023930709987326 <module 'iter_new' from '...'> single group: 2.45903080700009 random (few groups): 2.907660342001691 random (many groups): 2.309699692999857 only groups: 2.052835552000033 <module 'merge' from '...'> single group: 3.7620838259990705 random (few groups): 3.3524249689999124 random (many groups): 3.126650959999097 only groups: 2.9456547739991947

Python 这个递归函数能否变成具有类似性能的迭代函数

推荐答案

Python相关问答推荐

如何最好地处理严重级联的json

合并其中一个具有重叠范围的两个框架的最佳方法是什么？

解析讨论论坛只给我第一个用户 comments ，但没有给我其他用户回复

当测试字符串100%包含查询字符串时，为什么t fuzzywuzzy s Process.extractBests不给出100%分数？

Altair -箱形图边界设置为黑色，中线设置为红色

Python panda拆分列保持连续多行

如何让 turtle 通过点击和拖动来绘制？

在内部列表上滚动窗口

线性模型PanelOLS和statmodels OLS之间的区别

追溯(最近最后一次调用)：文件C：\Users\Diplom/PycharmProject\Yolo01\Roboflow-4.py，第4行，在模块导入roboflow中

Pandas 都是()，但有一个门槛

如何制作10，000年及以后的日期时间对象？

使用groupby Pandas的一些操作

如何更改分组条形图中条形图的 colored颜色？

为什么以这种方式调用pd.ExcelWriter会创建无效的文件格式或扩展名？

实现神经网络代码时的TypeError

UNIQUE约束失败：customuser. username

与命令行相比，相同的Python代码在Companyter Notebook中运行速度慢20倍

lityter不让我输入左边的方括号，'

未调用自定义JSON编码器