mirror of
https://github.com/ArchiveBox/ArchiveBox
synced 2024-11-22 20:23:12 +00:00
310b4d1242
Saves HTML text nodes and selected element attributes in `htmltotext.txt` for each Snapshot. Primarily intended to be used for search indexing. |
||
---|---|---|
.. | ||
__init__.py | ||
archive_org.py | ||
dom.py | ||
favicon.py | ||
git.py | ||
headers.py | ||
htmltotext.py | ||
media.py | ||
mercury.py | ||
pdf.py | ||
readability.py | ||
screenshot.py | ||
singlefile.py | ||
title.py | ||
wget.py |