ArchiveBox/archivebox/parsers
2024-04-24 19:41:31 -07:00
..
__init__.py replace uses of URL_REGEX with find_all_urls to handle markdown better 2024-04-24 17:45:45 -07:00
generic_html.py fix urljoin bug causing multiple slashes to be merged into one 2024-04-24 19:41:31 -07:00
generic_json.py Add generic_jsonl parser 2024-03-14 15:42:29 -07:00
generic_jsonl.py Add generic_jsonl parser 2024-03-14 15:42:29 -07:00
generic_rss.py Use feedparser for RSS parsing in generic_rss and pinboard_rss parsers 2024-03-01 11:25:45 -08:00
generic_txt.py replace uses of URL_REGEX with find_all_urls to handle markdown better 2024-04-24 17:45:45 -07:00
medium_rss.py use KEY, NAME, and PARSER to define parsers instead of hardcoding in init 2021-03-31 01:05:49 -04:00
netscape_html.py use KEY, NAME, and PARSER to define parsers instead of hardcoding in init 2021-03-31 01:05:49 -04:00
pinboard_rss.py Use feedparser for RSS parsing in generic_rss and pinboard_rss parsers 2024-03-01 11:25:45 -08:00
pocket_api.py fix typo in pocket_api articl variable name 2021-11-12 19:23:47 -05:00
pocket_html.py use KEY, NAME, and PARSER to define parsers instead of hardcoding in init 2021-03-31 01:05:49 -04:00
readwise_reader_api.py Fix readwise token 2023-10-29 17:27:04 -04:00
shaarli_rss.py use KEY, NAME, and PARSER to define parsers instead of hardcoding in init 2021-03-31 01:05:49 -04:00
url_list.py add timezone support, tons of CSS and layout improvements, more detailed snapshot admin form info, ability to sort by recently updated, better grid view styling, better table layouts, better dark mode support 2021-04-10 04:21:36 -04:00
wallabag_atom.py handle new wallabag export format with newlines mid-tag attributes 2022-05-09 19:07:48 -07:00