Original author(s) | ||
Stable release 4.5.1 / August 2, 2016; 6 months ago (2016-08-02) Repository code.launchpad.net/beautifulsoup/ Type HTML parser library, Web scraping |
Beautiful Soup is a Python package for parsing HTML and XML documents (including having malformed markup, i.e. non-closed tags, so named after tag soup). It creates a parse tree for parsed pages that can be used to extract data from HTML, which is useful for web scraping.
It is available for Python 2.6+ and Python 3.
References
Beautiful Soup (HTML parser) Wikipedia(Text) CC BY-SA