TXT=‘293包装(S)x000D印头产品:KA02033-E844 A5:发票:FIT-2401-01 HS编码:84732:100针织面料HS编码:6006.2:2.00INV#:TSTEX0124-009(TC-240021:)机织斜纹织物HS 600600D编码:6505.00:.90针织面料HS 600600D印刷头物品:KA02033-E844 A5:发票:FIT-2401-01 HS编码:6006.2:100针织面料HS 6000021编码:6505.00:.90针织面料HS 600600D
容器DRYU9124108:13套装(S),自动车齿机--CYJ500-1000HS编码:84597010; """
我想在HS代码之前取GET字符串:数字*(不包括HS代码:一些数字) 例如:[‘293包装(S)x000D印刷头物品:KA02033-E844 A5:发票:FIT-2401-01’,‘针织面料’,‘机织斜纹布’,‘针织面料’,‘针织面料’,‘面料:P57101(T989,100%涤纶,纬度:H 152 cm)’,‘集装箱DRYU9124108:13包(S),自动车齿MACHINESD-CYJ500-1000’]
试试巨 Python 吧: 进口再
TXT=‘293包装(S)x000D印头产品:KA02033-E844 A5:发票:FIT-2401-01 HS编码:84732:100针织面料HS编码:6006.2:2.00INV#:TSTEX0124-009(TC-240021:)机织斜纹织物HS 600600D编码:6505.00:.90针织面料HS 600600D印刷头物品:KA02033-E844 A5:发票:FIT-2401-01 HS编码:6006.2:100针织面料HS 6000021编码:6505.00:.90针织面料HS 600600D
容器DRYU9124108:13套装(S),自动车齿机--CYJ500-1000HS编码:84597010;‘’
X=re.findall(r‘(.+?)(?:(?=hs编码:\S*\d*)|(?=hs.code:\S*\d*))’,txt,re.S|re.IGNORECASE)
打印(X)
结果:[‘293Package(S)x000D\n打印标题项:KA02033-E844A5:发票:FIT-240021-01’,‘HS编码:6006.2:2.00INV#:TSTEX0124-009(TC-2033:)机织斜纹CAP’,‘HS编码:6505.00:0.90 KK宽带:H 152 cm)HS编码:54075200_x000D_\n(*)PRO:IL:IMPORT-SHA@ZHL.CN USCI:913101141:32276439L(**)USCI:\n容器DRYU9124108:13包(S)"自动车齿机-CYJ500-1000"]
结果包括"HS代码:数字".