From 9b8f935f2cbd5f8546b967ef59a62c50ddca0be1 Mon Sep 17 00:00:00 2001 From: Filipe Filardi Date: Sun, 22 Apr 2018 18:17:56 -0300 Subject: [PATCH] Improve the concept of web crawling --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index ed279a4..1f19510 100644 --- a/README.md +++ b/README.md @@ -85,7 +85,7 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php). - [URL Manipulation](#url-manipulation) - [Video](#video) - [Web Content Extracting](#web-content-extracting) - - [Web Crawling](#web-crawling) + - [Web Crawling & Web Scraping](#web-crawling-&-web-scraping) - [Web Frameworks](#web-frameworks) - [WebSocket](#websocket) - [WSGI Servers](#wsgi-servers) @@ -1203,9 +1203,9 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php). * [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word, PowerPoint, PDFs, etc. * [toapi](https://github.com/gaojiuli/toapi) - Every web site provides APIs. -## Web Crawling +## Web Crawling & Web Scraping -*Libraries for scraping websites.* +*Libraries to automate data extraction from websites.* * [cola](https://github.com/chineking/cola) - A distributed crawling framework. * [Demiurge](https://github.com/matiasb/demiurge) - PyQuery-based scraping micro-framework.