abstract_api.web_scraping package#
Submodules#
Module contents#
- class abstract_api.web_scraping.WebScraping(api_key: str | None = None)[source]#
Bases:
BaseService[WebScrapingResponse]AbstractAPI web scraping service.
Used to extract data from a given URL.
- Attributes:
_subdomain: Web scraping service subdomain.
- scrape(url: str, render_js: bool | None = None, block_ads: bool | None = None, proxy_country: str | None = None) WebScrapingResponse[source]#
Extracts data from the given URL.
- Args:
- url: The URL to extract the data from. Note that this parameter
should include the full HTTP Protocol (http:// or https://). If your URL has parameters, you should encode it. For example the & character would be encoded to %26.
- render_js: If True the request will render Javascript on the
target site. Note that Javascript is rendered via a Google Chrome headless browser. Defaults to False.
- block_ads: If True the request will block any advertisements it
can identify on the target site. Defaults to False.
- proxy_country: The country to make the request from.
The country should be submitted in the two letter, ISO 3166-1 alpha-2 code.
- Returns:
WebScrapingResponse representing API call response.
- class abstract_api.web_scraping.WebScrapingResponse(response: Response)[source]#
Bases:
FileResponseWeb scraping service response.