get_soup
- get_soup(url: str, *, verify: bool = True, timeout: TimeoutHint | None = None, user_agent: str | None = None) bs4.BeautifulSoup[source]
Get a beautiful soup parsed version of the given web page.
- Parameters:
url – The URL to download and parse with BeautifulSoup
verify – Should SSL be used? This is almost always true, except for Ensembl, which makes a big pain
timeout – How many integer seconds to wait for a response? Defaults to 15 if none given.
user_agent – A custom user-agent to set, e.g., to avoid anti-crawling mechanisms
- Returns:
A BeautifulSoup object