Movatterモバイル変換


[0]ホーム

URL:


Skip to main content
OurBuilding Ambient Agents with LangGraph course is now available on LangChain Academy!
Open on GitHub

PullMd Loader

PullMd is a service that converts web pages into Markdown format. Thelangchain-pull-md package utilizes this service to convert URLs, especially those rendered with JavaScript frameworks like React, Angular, or Vue.js, into Markdown without the need for local rendering.

Installation and Setup

To get started withlangchain-pull-md, you need to install the package via pip:

pip install langchain-pull-md

See theusage example for detailed integration and usage instructions.

Document Loader

ThePullMdLoader class inlangchain-pull-md provides an easy way to convert URLs to Markdown. It's particularly useful for loading content from modern web applications for use within LangChain's processing capabilities.

from langchain_pull_mdimport PullMdLoader

# Initialize the loader with a URL of a JavaScript-rendered webpage
loader= PullMdLoader(url='https://example.com')

# Load the content as a Document
documents= loader.load()

# Access the Markdown content
for documentin documents:
print(document.page_content)

This loader supports any URL and is particularly adept at handling sites built with dynamic JavaScript, making it a versatile tool for markdown extraction in data processing workflows.

API Reference

For a comprehensive guide to all available functions and their parameters, visit theAPI reference.

Additional Resources


[8]ページ先頭

©2009-2025 Movatter.jp