我有一个网站,里面有这样的HTML struct :
<div class="ui-rectframe">
<p class="ui-li-desc"></p>
<h4 class="ui-li-heading">Qualifications</h4>
MBBS (University of Singapore, Singapore) 1978
<br>
MCFP (Family Med) (College of Family Physicians, Singapore) 1984
<br>
Dip Geriatric Med (NUS, Singapore) 2012
<br>
GDPM (NUS, Singapore) 2015
<br>
<h4 class="ui-li-heading">Type of first registration / date</h4>
Full Registration (14/06/1979)<br>
<h4 class="ui-li-heading">Type of current registration / date</h4>
Full Registration (14/06/1979)<br>
<h4 class="ui-li-heading">Practising Certificate Start Date</h4>
01/01/2022<br>
<h4 class="ui-li-heading">Practising Certificate End Date</h4>
31/12/2023<br>
<p></p><br>
</div>
我需要提取资格-- [ 'MBBS (University of Singapore, Singapore) 1978', 'MCFP (Family Med) (College of Family Physicians, Singapore) 1984', 'Dip Geriatric Med (NUS, Singapore) 2012', 'GDPM (NUS, Singapore) 2015' ]
我如何使用CSS Select 器或XPath来实现这一点?我可以提取父div中的所有文本项,但不能将资格与其他值(如首次注册的类型等)分开.