ReadTheDocs Documentation
Read the Docs is an open-sourced free software documentation hosting platform. It generates documentation written with the
Sphinx
documentation generator.
This notebook covers how to load content from HTML that was generated as part of aRead-The-Docs
build.
For an example of this in the wild, seehere.
This assumes that the HTML has already been scraped into a folder. This can be done by uncommenting and running the following command
%pip install--upgrade--quiet beautifulsoup4
#!wget -r -A.html -P rtdocs https://python.langchain.com/en/latest/
from langchain_community.document_loadersimport ReadTheDocsLoader
API Reference:ReadTheDocsLoader
loader= ReadTheDocsLoader("rtdocs")
docs= loader.load()
Related
- Document loaderconceptual guide
- Document loaderhow-to guides