Excel文件的URL如下: https://www.gso.gov.vn/wp-content/uploads/2024/03/IIP-ENG.xlsx
我有这个代码:
from datetime import datetime, timedelta
url = 'https://www.gso.gov.vn/wp-content/uploads/' + datetime.strftime(datetime.now() - timedelta(30), '%y') +'/' + datetime.strftime(datetime.now() - timedelta(30), '%m') + '/IIP-ENG.xlsx'
import requests
resp = requests.get(url, verify=False)
output = open('IIP.xlsx', 'wb')
output.write(resp.content)
output.close()
我可以看到正在下载的文件,但无法在Office Excel中打开它.文件损坏了.
resp
<[404]>
我也不能用这个代码打开:
import pandas as pd
df = pd.read_excel(open('IIP.xlsx', 'rb'),sheet_name=0, engine='openpyxl')
print(df.head(5))
BadZipFile错误.该文件不是Zip文件.
怎么解决这个问题?