
An SEO tool that analyzes the structure of a site, crawls the site, counts words in the body of the site and warns of any technical SEO issues.
Requires Python 3.6+, BeautifulSoup4 and urllib3.
pip3 install pyseoanalyzer
docker run sethblack/python-seo-analyzer [ARGS ...]
If you run without a sitemap it will start crawling at the homepage.
seoanalyze http://www.domain.com/
Or you can specify the path to a sitmap to seed the urls to scan list.
seoanalyze http://www.domain.com/ --sitemap path/to/sitemap.xml
HTML output can be generated from the analysis instead of json.
seoanalyze http://www.domain.com/ --output-format html
Theanalyze
function returns a dictionary with the results of the crawl.
fromseoanalyzerimportanalyzeoutput=analyze(site,sitemap)print(output)
In order to analyze heading tags (h1-h6) and other extra additional tags as well, the following options can be passed to theanalyze
function
fromseoanalyzerimportanalyzeoutput=analyze(site,sitemap,analyze_headings=True,analyze_extra_tags=True)print(output)
By default, theanalyze
function analyzes all the existing inner links as well, which might be time consuming.This default behaviour can be changed to analyze only the provided URL by passing the following option to theanalyze
function
fromseoanalyzerimportanalyzeoutput=analyze(site,sitemap,follow_links=False)print(output)
Alternatively, you can run the analysis as a script from the seoanalyzer folder.
python analyzer.py https://www.sethserver.com/ -f html> results.html
If you getrequests.exceptions.SSLError
at either the command-line or via the python-API, try using:
instead of..