我正在try 迭代URL列表,并使用请求和BeatifulSoup来提取每个URL的标题名称.
但我一直收到这样的错误:
请求.例外.无效架构:找不到"[‘https://reddit.com/?feed=home’,‘https://reddit.com/chunkCSS/CollectionCommentsPage~CommentsPage~CountryPage~Frontpage~GovernanceReleaseNotesModal~ModListing~Mod~e3d63e32.74eb929a3827c754ba25_.css’,‘https://reddit.com/chunkCSS/CountryPage~Frontpage~ModListing~Multireddit~ProfileComments~ProfileOverview~ProfilePosts~Subreddit.e72fce90a7f3165091b9_.css’,‘https://reddit.com/chunkCSS/Frontpage.85a25b7700617eafa94b_.css’,‘https://reddit.com/?feed=home’,‘https://reddit.com/r/popular/’,]的连接适配器"
《守则》:
pages = []
for admin_login_pages in domains:
with open("urls.txt", "w") as f:
f.write(admin_login_pages)
if "admin" in admin_login_pages:
if "login" in admin_login_pages:
pages.append(admin_login_pages)
with open("urls.txt", "r") as fread:
url_list = [x.strip() for x in fread.readlines()]
r = requests.get(str(url_list))
soup = BeautifulSoup(r.content, 'html.parser')
for title in soup.find_all('title'):
print(f"{admin_login_pages} - {title.get_text()}")
if not pages:
print(f"{Fore.RED} No admin or login pages Found")
else:
for page_list in pages:
print(f"{Fore.GREEN} {page_list}")