Libraries for Python Web Scraping:
Requests
- using it to get the raw ingredients (i.e. raw HTML)
BeautifulSoup
- a parsing library that can use different parsers. A parser is simply a program that can extract data from HTML and XML documents.
lxml
- a high-performance, production-quality HTML and XML parsing library
Selenium
- to scrape sites with data tucked away by JavaScript.
- if you need to build a real spider or web-crawler