ArchiveBox/archivebox/extractors
2024-09-25 05:10:09 -07:00
..
__init__.py move almost all config into new archivebox.CONSTANTS 2024-09-25 05:10:09 -07:00
archive_org.py load EXTRACTORS dynamically using importlib.import_module 2024-05-11 22:28:59 -07:00
dom.py move pdf, screenshot, dom, singlefile, and ytdlp extractor config to new plugin system 2024-09-25 00:42:26 -07:00
favicon.py config and attr access improvements 2024-08-20 18:31:21 -07:00
git.py config and attr access improvements 2024-08-20 18:31:21 -07:00
headers.py load EXTRACTORS dynamically using importlib.import_module 2024-05-11 22:28:59 -07:00
htmltotext.py move almost all config into new archivebox.CONSTANTS 2024-09-25 05:10:09 -07:00
media.py move pdf, screenshot, dom, singlefile, and ytdlp extractor config to new plugin system 2024-09-25 00:42:26 -07:00
mercury.py load EXTRACTORS dynamically using importlib.import_module 2024-05-11 22:28:59 -07:00
pdf.py move pdf, screenshot, dom, singlefile, and ytdlp extractor config to new plugin system 2024-09-25 00:42:26 -07:00
readability.py move almost all config into new archivebox.CONSTANTS 2024-09-25 05:10:09 -07:00
screenshot.py move pdf, screenshot, dom, singlefile, and ytdlp extractor config to new plugin system 2024-09-25 00:42:26 -07:00
singlefile.py move pdf, screenshot, dom, singlefile, and ytdlp extractor config to new plugin system 2024-09-25 00:42:26 -07:00
title.py load EXTRACTORS dynamically using importlib.import_module 2024-05-11 22:28:59 -07:00
wget.py rename datetime fields to _at, massively improve ABID generation safety and determinism 2024-09-04 23:42:36 -07:00