我正在try 设置一个指标,以判断新申请何时导致旧申请被拒绝.
如果personal_id内的任何rejected_time在creation_timestamp后5分钟内发生,则由于新应用程序而已被拒绝.基于此,我应该创建如示例中所示的列"new_app_causes_rejecting".
个人ID有数十万个,大多数都有多个应用程序ID,并且应用程序ID内的行数各不相同.
personal_id | application_id | creation_timestamp | approved_amount | rejected_time | new_application_causes_rejection |
---|---|---|---|---|---|
5a | 694f | 2023-01-24 13:01:07.939534 | 8000.0 | 2023-01-24 13:13:15.499000 | 0 |
5a | 694f | 2023-01-24 13:01:07.939534 | 8000.0 | 2023-01-24 14:38:02.359000 | 1 |
5a | 694f | 2023-01-24 13:01:07.939534 | 8000.0 | 2023-01-24 14:37:18.616000 | 1 |
5a | 694f | 2023-01-24 13:01:07.939534 | NaN | 2023-01-24 13:03:59.626000 | 0 |
5a | 43fa | 2023-01-24 14:36:08.287521 | NaN | 2023-01-24 14:37:22.096000 | 0 |
5a | 43fa | 2023-01-24 14:36:08.287521 | 13000.0 | 2023-01-24 14:39:31.750000 | 1 |
5a | 43fa | 2023-01-24 14:36:08.287521 | 13000.0 | 2023-02-02 08:42:26.980106 | 1 |
5a | 43fa | 2023-01-24 14:36:08.287521 | NaN | 2023-01-24 14:37:22.948214 | 0 |
5a | a4b6 | 2023-01-24 14:38:42.625969 | 5000.0 | 2023-02-02 08:42:26.980106 | 0 |
5a | a4b7 | 2023-01-24 14:38:42.625969 | NaN | 2023-01-24 14:38:46.922000 | 0 |
5a | a4b8 | 2023-01-24 14:38:42.625969 | 8000.0 | 2023-02-02 08:42:26.980106 | 0 |