Explore resources by topic or category
Browse by Category
Blog
When LLM Web Scraping isn't Enough to Scale Web Scraping
Anita Clarke
6 mins
June 12, 2024
If you’re not using AI, you’re being left behind. Ever since ChatGPT burst onto the scene, this is the message developers are constantly hearing.
Blog
Pioneering web scraping technologies and building a developer community, Part 2
Voy Zeglin
10 mins
June 10, 2024
In the second part of our in-depth interview with Voy Zeglin, we delve into the nitty-gritty of web scraping. Voy shares his expertise on how his company, Data Miners, deals with anti-scraping measures and maintains high data quality.
Blog
The trade-offs in crawling infrastructure in the modern anti-bot landscape
Daniel Cave
8 mins
April 10, 2024
In this article, I’llexplain the problem of anti-bot technology for web scraping developers through the lens of the anti-bot distribution curve (a view of the top 250,000 websites and the relative complexity of their anti-bot tech) and the landscape of anti-bot tech across the web.
Blog
What is Web Data Harvesting? 1 key description will help you
Himanshi Bhatt
2 Mins
March 26, 2024
Blog
Compliant Web Scraping with AI
Callum Henry
6 mins
March 15, 2024
Zyte’s flagship product, Zyte API, now includes built-in features that automate crawling using spider templates, and our patented AI-powered automated extraction, which gives you quality structured data quickly without writing custom parsing code.
Webinars
Exploring the Frontier of AI Scraping - A fireside chat with Zyte's Tech Leaders
February 26, 2024
Blog
Web Scraping vs Data Mining | What's the Difference?
Sarah Lang
5 Mins
February 17, 2024
Data mining and web scraping – sounds like two buzzwords meaning the same thing. Quite often data mining is misunderstood as the process of obtaining information from a website;
Blog
The challenges e-commerce retailers face managing their web scraping proxies
Ian Kerins
7 min
February 16, 2024
In this article we discuss some main challenges that e-commerce retailers face on a daily basis due to the amount of web data needed and how to solve them.
Blog
Court Rules Meta's Terms Do Not Prohibit Scraping of Public Data
Sanaea Daruwalla
7 mins
January 30, 2024
In 2023, Meta sued Bright Data for scraping data from Facebook and Instagram, alleging that its scraping breached Facebook and Instagram’s terms of service and is thus a breach of contract.
Blog
Celebrating Ethics in Web Scraping With the EWDCI Certification
Sanaea Daruwalla
4 mins
November 21, 2023
The reputation of web scraping hasn’t always been the best. Unsavory actors have cast a shadow over the reputable parts of the web scraping industry at large, and it has to stop.