mirror of
https://github.com/vinta/awesome-python
synced 2024-11-15 00:07:24 +00:00
add textract to Web Content Extracting section
This commit is contained in:
parent
a7942efbbf
commit
d52ea10139
1 changed files with 1 additions and 0 deletions
|
@ -565,6 +565,7 @@ A curated list of awesome Python frameworks, libraries and software. Inspired by
|
|||
* [Haul](https://github.com/vinta/Haul) - An Extensible Image Crawler.
|
||||
* [python-readability](https://github.com/buriy/python-readability) - Fast Python port of arc90's readability tool.
|
||||
* [opengraph](https://github.com/erikriver/opengraph) - A Python module to parse the Open Graph Protocol
|
||||
* [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word documents, PowerPoint presentations, PDFs, etc.
|
||||
|
||||
## Forms
|
||||
|
||||
|
|
Loading…
Reference in a new issue