near_me
Linear Algebra
keyboard_arrow_down 54 guides
chevron_leftAttribute Cookbook
check_circle
Mark as learned thumb_up
0
thumb_down
0
chat_bubble_outline
0
Comment auto_stories Bi-column layout
settings
Finding all links in Beautiful Soup
schedule Aug 11, 2023
Last updated local_offer
Tags Python●Beautiful Soup
tocTable of Contents
expand_more Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!
Start your free 7-days trial now!
To find all links (i.e. elements with the a
tag) in Beautiful Soup, use the find_all
method.
Example
Just as an example consider the following HTML document:
my_html = """ <html> <a href="www.abc.com">Site A</a> <a href="www.def.com">Site B</a> <a>Site C</a> </html>"""
soup = BeautifulSoup(my_html)
Use the find_all(tag_name)
like so:
If you want valid links that contain an url, specify href=True
:
Notice how Site C was omitted since it does not contain the href
attribute.
To just extract the URLs:
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Comment
Citation
Ask a question or leave a feedback...
thumb_up
0
thumb_down
0
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!