near_me
Linear Algebra
keyboard_arrow_down 54 guides
chevron_leftDocumentation
Method findMethod find_parentMethod find_parentsMethod insertMethod insert_afterMethod insert_beforeMethod new_tagMethod prettifyMethod extendMethod appendMethod has_attrMethod find_allMethod select_oneMethod selectMethod unwrapMethod wrapMethod replace_withProperty previous_elementsProperty next_elementsProperty parentProperty previous_elementProperty next_siblingsProperty previous_siblingsProperty stringsProperty stripped_stringsProperty contentsProperty childrenProperty next_elementMethod extractMethod find_all_nextProperty previous_siblingMethod find_previous_siblingMethod find_next_siblingProperty next_siblingProperty stringMethod find_nextMethod clearProperty descendantsProperty parentsMethod get_textMethod decompose
check_circle
Mark as learned thumb_up
1
thumb_down
0
chat_bubble_outline
0
Comment auto_stories Bi-column layout
settings
Beautiful Soup | extract method
schedule Aug 12, 2023
Last updated local_offer
Tags Python●Beautiful Soup
tocTable of Contents
expand_more Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!
Start your free 7-days trial now!
In Beautiful Soup, the extract
method removes either:
an element and its string
a string
The method returns whatever it removed.
Examples
Consider the following HTML document:
my_html = """ <div> <p>Bob is great</p> </div>"""soup = BeautifulSoup(my_html)
Removing a tag and its content
To remove the p
tag as well as its inner content:
Here, we see <p>Bob is great</p>
as the output because the extract()
method returns whatever you've removed.
After calling extract()
on the p
tag, it is no longer part of our parse tree:
print(div_tag)
<div></div>
However, the catch is that, although the p
tag has been detached from our parse tree, it still exists in memory, and we can still access it:
print(p_tag)
<p>Bob is great</p>
If you want to completely remove the p_tag
, then call the decompose()
method instead of extract()
.
Removing a string
To remove the inner text of p
:
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Comment
Citation
Ask a question or leave a feedback...
Official Beautiful Soup Documentation
https://www.crummy.com/software/BeautifulSoup/bs4/doc/#extract
thumb_up
1
thumb_down
0
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!