Supriya Ghosh (Editor)

Beautiful Soup (HTML parser)

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Original author(s)
  
Leonard Richardson

Platform
  
Python

Written in
  
Python

Stable release
  
4.5.1 / August 2, 2016; 6 months ago (2016-08-02)

Repository
  
code.launchpad.net/beautifulsoup/

Type
  
HTML parser library, Web scraping

Beautiful Soup is a Python package for parsing HTML and XML documents (including having malformed markup, i.e. non-closed tags, so named after tag soup). It creates a parse tree for parsed pages that can be used to extract data from HTML, which is useful for web scraping.

It is available for Python 2.6+ and Python 3.

References

Beautiful Soup (HTML parser) Wikipedia