Why was Topic unable to pull an outline from a competitor?

Although we have put a lot of effort into extracting as many outlines as possible from competitor content, Topic is occasionally unable to pull in an outline from a page.

Here are a few reasons why this happens:

  1. Not Enough Text Content: Some pages don't have enough text content to create an outline. Because we rely on heading tags (h1, h2, h3, etc.) to build the outline, a page with little text content will result in no outline.
  2. Blocked Page: Some websites block third parties from programmatically accessing the contents of their site.
  3. Unstructured Content: There are many parts of a website that aren't part of its content, such as the navigation, footer, sidebar, and advertisements. When building the outline, Topic tries to filter out these extraneous elements. But if the page has a structure that confuses our system, Topic might not be able to identify the content and produce an outline.
