Movatterモバイル変換


[0]ホーム

URL:


Skip to content
DEV Community
Log in Create account

DEV Community

Jennifer Fadriquela
Jennifer Fadriquela

Posted on

     

Webscraping: Beginner's Thoughts

Decided to learn webscraping this month. The first thing I did was to watch courses in Pluralsight:
Scraping Dynamic Web Pages with Python and Selenium
Scraping Your First Web Page with Python
Exploring Web Scraping with Python

Webscraping can be done by using Python libraries likeBeautifulSoup andRequests. This assumes that you have all urls predetermined and will just scrape the page source.

But if you will scrape a dynamic page (ex: a div is rendered only if a specific button was clicked) then you will need a library likeSelenium to emulate user interactions.

When I was confident with the basics, I took a step further and learned theScrapy framework. This requires a steeper learning curve than native Python libraries because you have to know the flow of how objects are passed in the framework. The main advantage is you won't have to write boilerplate codes (writing data to files, handling url requests, data modelling) redundantly because those are already integrated with its pipeline.

Here are some Scrapy Pluralsight courses that helped me:
Crawling the Web with Python and Scrapy
Extracting Structured Data from the Web Using Scrapy

Top comments(0)

Subscribe
pic
Create template

Templates let you quickly answer FAQs or store snippets for re-use.

Dismiss

Are you sure you want to hide this comment? It will become hidden in your post, but will still be visible via the comment'spermalink.

For further actions, you may consider blocking this person and/orreporting abuse

Stuff Developer
  • Location
    Manila
  • Work
    Software Engineer
  • Joined

Trending onDEV CommunityHot

DEV Community

We're a place where coders share, stay up-to-date and grow their careers.

Log in Create account

[8]ページ先頭

©2009-2025 Movatter.jp