Zyte Developers Community newsletter issue #5
Hi there,
If you are not signed up already for the Zyte Developers Community newsletter, you can sign up here.
In this issue:
- Web scraping for a COVID-19 vaccine
- Scraping Reddit with Scrapy
- Easiest way to extract data from Wikipedia
- Dateparser - Python parser for human-readable dates
- A sip of BeautifulSoup
Web scraping for a COVID-19 vaccine
Miguel MagalhĆ£es was having a hard time finding covid vaccine slots in France. So he built aĀ simple Python scriptĀ that checked the server for the next available slot, until it finds a vacancy.
Scraping Reddit with Scrapy
This one is a bit long but interesting read. Brendan Ferris usesĀ Scrapy to extract multiple fieldsĀ like username, title, content link, etc. from posts onĀ old.reddit.com.
Easiest way to extract data from Wikipedia
In this article, Elliot discusses the Wikipedia module in Python which allows you toĀ scrape data from WikipediaĀ in the easiest way possible.
Dateparser - Python parser for human-readable dates
Dateparser was developed to make date extraction from HTML pages easier. Initially, used by web scraping developers, it was quickly adopted by the wider community and has been used for multiple applications like command-line tools, chatbots, etc. This blog post coversĀ everything you need to know about date parsing.
A sip of BeautifulSoup
This article is for those among us who are just starting to get acquainted to web scraping. Runjot Kaur'sĀ introduction to BeautifulSoupĀ is very easy to understand.