我确实有一个问题希望—我确信你能帮我.假设我确实有两个框架,每个框架都有多列,但为了简单起见,让我们关注每帧一列. Important:Both frame are different in size, with A being shorter
import pandas as pd
FrameA=pd.DataFrame({"A":["00281378554", "10862520000","82540193700","76015394900","00134355050","21864009"]})
FrameB=pd.DataFrame({"A":["AT511634000134355050","AT411513000281378554", "AT711509100151013992",
"AT511509000121340020","AT424480010862520000","AT742011182540193700","AT531200076015394900","HU02142201082186400900000000"
]})
我的目标如下:我想判断帧A中列A的每个元素,如果从帧B中列A中的it is contained,如果是,那么我想在帧A中创建一个新列(较短的维度),结果如下:
Frame A
Col A Col B
00281378554 AT411513000281378554
10862520000 AT424480010862520000
82540193700 AT742011182540193700
76015394900 AT531200076015394900
00134355050 AT511634000134355050
21864009 HU02142201082186400900000000
因此,frameA的新列B将包含frameB中的元素,该元素包含来自frameB的字符串,
我try 了np.where
,但由于frameA的尺寸比frameB短,所以它不工作.所以我能做的就是使用for循环.但这是非常耗时的,我认为必须有一个更优雅的解决方案
有什么建议吗?