Diffbot provides AI-powered tools to extract and structure data from web pages, transforming unstructured web content into structured, linked data.
Tool to search data extracted by crawl or bulk jobs using dql queries. use after data extraction jobs complete to retrieve search results.
Tool to retrieve account details, including plan information and usage statistics. use after authenticating to verify subscription and daily quota status.
Tool to automatically determine a page's content type and route it to the appropriate extraction api. use when you have only a url and need diffbot to choose the right extractor.
Tool to extract information from articles, including authors, publication dates, and images. use when you need structured metadata from a web article url.
Tool to extract threads of content from forums, comment sections, and review pages. use when you need structured discussion data from web pages after identifying the discussion url.
Tool to extract event details from web pages. use when you need structured event data such as venue, date, and description.