我有一个这样的字符串:
"The dates are from 30 June 2019 to 1 January 2022 inclusive"
我想用spaCy从这个字符串中提取日期.
以下是我到目前为止的功能:
def extract_dates_with_year(text):
doc = nlp(text)
dates_with_year = []
for ent in doc.ents:
if ent.label_ == "DATE":
dates_with_year.append(ent.text)
return dates_with_year
这将返回以下输出:
['30 June 2019 to 1 January 2022']
但是,我希望输出如下:
['30 June 2019', '1 January 2022']