Commit graph

718 commits

Author SHA1 Message Date
Angel Rey
01461a98a7 Replaced os.path in logging_util.py 2020-10-02 15:46:39 -05:00
Angel Rey
25ac18c8b7 Replaced os.path in system.py 2020-10-02 15:46:39 -05:00
Angel Rey
16b5ca3207 Replaced os.path in init config 2020-10-02 15:46:39 -05:00
Angel Rey
897bace84d Fixed paths in settings 2020-10-02 15:46:39 -05:00
Angel Rey
0e7c337dcb Replaced os.path in settings.py 2020-10-02 15:46:39 -05:00
Angel Rey
ce71747538 replaced os.path in init extractors 2020-10-02 15:46:39 -05:00
Angel Rey
fa364ed728 Replaced od.path in init cli 2020-10-02 15:46:39 -05:00
Angel Rey
3fb410a604 Replaced os.path in favicon.py 2020-10-02 15:46:39 -05:00
Angel Rey
ad04fb5300 Replaced os.path in init index 2020-10-02 15:46:39 -05:00
Angel Rey
78f7062761 Replaced os.path in html.py 2020-10-02 15:46:39 -05:00
Angel Rey
8b03c37fbb Replaced os.path in json.py 2020-10-02 15:46:39 -05:00
Angel Rey
9264ad88e0 Fixed string casting 2020-10-02 15:46:39 -05:00
Angel Rey
7d513b9b19 Replaced os.path in schema.py 2020-10-02 15:46:39 -05:00
Angel Rey
2c62abb270 Replaced os.path in init parsers 2020-10-02 15:46:39 -05:00
ttimasdf
eda3836dee feat: add og:title metadata as alternative title 2020-09-27 12:54:52 +08:00
Cristian
5975c27a6a fix: Remove trailing slash from public index 2020-09-25 13:48:19 -05:00
Cristian
abde871a3c fix: Wget absolute path generating issues 2020-09-25 08:24:06 -05:00
Angel Rey
4581ea956f Fixed empty tags 2020-09-24 15:34:23 -05:00
Angel Rey
533ae7413c Removed comments 2020-09-24 15:34:23 -05:00
Angel Rey
e06d3f9128 Fixed Link schema 2020-09-24 15:34:23 -05:00
Angel Rey
45775c607c Fixed empty tags 2020-09-24 15:34:23 -05:00
Angel Rey
f26c0c6cd8 Fix serialization 2020-09-24 15:34:23 -05:00
Angel Rey
62c9028212 Improved tags 2020-09-24 15:34:23 -05:00
Cristian
7d3767b882 fix: oneshot command not running extractors 2020-09-24 12:56:16 -05:00
Cristian
62ed11a5ca fix: Improve headers handling 2020-09-24 12:55:51 -05:00
Angel Rey
a40af98ced removed static file check 2020-09-24 12:55:51 -05:00
Angel Rey
f0915a56aa Replaced get method 2020-09-24 12:55:51 -05:00
Cristian
e0939d7fe4 fix: Syntax issue on config module 2020-09-24 08:48:58 -05:00
Nick Sweeting
a7cd01ad4f
Merge pull request #480 from apkallum/master 2020-09-23 17:30:11 -04:00
Nick Sweeting
38c1f96e2c
Update archivebox/config/__init__.py 2020-09-23 17:29:57 -04:00
Karim
2b987421fb
simpler check for CHROME_USER_DATA_DIR 2020-09-23 17:23:53 -04:00
apkallum
508984c941 fix: ensure chrome data dir is none when appropiate 2020-09-23 13:22:10 -04:00
Angel Rey
dc160daba8 Fixed lint 2020-09-23 11:07:00 -05:00
Angel Rey
7fd7dced9a Added curl params 2020-09-23 11:07:00 -05:00
Angel Rey
a8a8fd14ac Fixed indent headers.json 2020-09-23 11:07:00 -05:00
Angel Rey
852e3c9cff Added headers extractor 2020-09-23 11:07:00 -05:00
Cristian
eb34a6af62 lint: Fix mercury extractor lint issues 2020-09-23 10:35:39 -05:00
Cristian
46b9e3d536 fix: Fix mercury extractor test 2020-09-23 10:34:05 -05:00
ttimasdf
2bf496e7e9 feat: Add mercury-parsed content to summary page 2020-09-22 18:44:12 -05:00
ttimasdf
357b677363 fix: add mercury-parser to extractors list 2020-09-22 18:44:12 -05:00
ttimasdf
706bd895e0 feat: Add mercury-parser 2020-09-22 18:44:12 -05:00
Cristian
b18bbf8874 test: Fix tests post-rebase 2020-09-17 09:09:52 -05:00
apkallum
422664079a fix test type casting for folder['path'] 2020-09-17 09:09:52 -05:00
apkallum
0144f19227 fix github action folder listing 2020-09-17 09:09:52 -05:00
apkallum
1aa7bac85b fix oneshot command type signature 2020-09-17 09:09:52 -05:00
apkallum
95157427c2 update stubs file 2020-09-17 09:09:52 -05:00
apkallum
008769d296 add support for Paths in json encoder 2020-09-17 09:09:52 -05:00
apkallum
abf68e5437 no home() in Paths 2020-09-17 09:09:52 -05:00
apkallum
b99784b919 pathlib with / syntax for config, index 2020-09-17 09:09:52 -05:00
apkallum
594d9e49ce first attempt to migrate to Pathlib 2020-09-17 09:09:52 -05:00
Cristian
b2ed96c35a feat: Redirect old add view to the main one 2020-09-17 09:08:20 -05:00
Cristian
b3ec170e39 fix: Remove unused imports 2020-09-16 08:50:56 -05:00
Cristian
bc116c25f8 refactor: Change View to FormView 2020-09-16 08:50:56 -05:00
apkallum
a06bd715a9 remove reference to old home 2020-09-16 08:50:56 -05:00
apkallum
1cdaad00a8 no more oldhome, cbvs uniform across views 2020-09-16 08:50:56 -05:00
apkallum
94a590b31a factor out a base.html template 2020-09-16 08:50:56 -05:00
apkallum
5e8c115f3f unify public archive view 2020-09-16 08:50:56 -05:00
apkallum
3288f8579b add public add view + toggle setting 2020-09-16 08:50:56 -05:00
apkallum
6f7cc2b3ef ensure results have icons 2020-09-16 08:50:56 -05:00
apkallum
3048c0f6dc add icons to new public view 2020-09-16 08:50:56 -05:00
apkallum
c50af04cce search view inherits from modified public view 2020-09-16 08:50:56 -05:00
apkallum
948b2469f6 no files count in public view 2020-09-16 08:50:56 -05:00
apkallum
5c4ac3cf3d new public view derived from django 2020-09-16 08:50:56 -05:00
Cristian
50f3f16203 lint: Remove unused import 2020-09-15 08:05:46 -05:00
Cristian
0a83392cbf fix: Replace any typing with Union[Iterable[Link], QuerySet] in archive_links 2020-09-15 08:05:46 -05:00
Cristian
779a446085 feat: Make title and tags editable in admin 2020-09-15 08:05:46 -05:00
Cristian
5348f4735a fix: Change check to avoid issues with empty querysets 2020-09-15 08:05:46 -05:00
Cristian
cf18130f85 feat: Add deprecation warning for index.json 2020-09-15 08:05:46 -05:00
Cristian
018bd91745 refactor: Remove get_iter lambda from archive_links 2020-09-15 08:05:46 -05:00
Cristian Vargas
5e9b3099c6 Update fix_duplicate_links_in_index docstring
Co-authored-by: Nick Sweeting <git@sweeting.me>
2020-09-15 08:05:46 -05:00
Cristian
01fb44fd40 refactor: Change archive_links check to focus on queryset, so it allows other iterables and not just lists 2020-09-15 08:05:46 -05:00
Cristian
fa622d3e14 refactor: Replace --index with --with-headers in the list command to make it more explicit. Change it so it affects the csv output too. 2020-09-15 08:05:46 -05:00
Cristian
2aa8d69b72 fix: Save history in main index (to mimic previous behaviour) 2020-09-15 08:05:46 -05:00
Cristian
7e9d195d13 feat: Update list command to sort using sqlite 2020-09-15 08:05:46 -05:00
Cristian
f55153eab3 feat: Update update command to work with querysets 2020-09-15 08:05:46 -05:00
Cristian
fe9604a772 feat: Add tests for remove command 2020-09-15 08:05:46 -05:00
Cristian
a8ed72501d feat: Refactor remove command to use querysets 2020-09-15 08:05:46 -05:00
Cristian
be520d137a feat: Refactor add method to use querysets 2020-09-15 08:05:46 -05:00
Cristian
6a2e6aad2f fix: status command was failing on empty archives 2020-09-15 08:05:46 -05:00
Cristian
be0dff8126 feat: Add tests to refactored init command 2020-09-15 08:05:46 -05:00
Cristian
404f333e17 feat: Refactor get_invalid_folders to work with a queryset instead of a list of links 2020-09-15 08:05:46 -05:00
Cristian
dae606de6e feat: Update init to take advantage of querysets to reduce memory consumption 2020-09-15 08:05:46 -05:00
Cristian
6b4b7127b4 feat: Remove unused imports 2020-09-15 08:05:46 -05:00
Cristian
b8585dd92e feat: load_main_index returns a queryset now 2020-09-15 08:05:46 -05:00
Cristian
a77d6dc235 feat: list command fails when --index is used without --json or --html 2020-09-15 08:05:46 -05:00
Cristian
885ff50449 feat: Add html export to list command 2020-09-15 08:05:46 -05:00
Cristian
aab8f96520 feat: Add flag to list command to support index like output 2020-09-15 08:05:46 -05:00
Cristian
be57db1369 feat: Save static indexes at the end of init 2020-09-15 08:05:46 -05:00
Cristian
c16fdf1b47 feat: Update data folder check 2020-09-15 08:05:46 -05:00
Cristian
874403e667 feat: Remove patch_main_index 2020-09-15 08:05:46 -05:00
Cristian
31343c1367 feat: Update extractors and add command to use sql index as source of truth 2020-09-15 08:05:46 -05:00
Cristian
e9caee6b10 feat: Update status command to consider sql as the main index 2020-09-15 08:05:46 -05:00
Cristian
02f36b2096 feat: Replace index.json with index.sql as the main index in init 2020-09-15 08:05:46 -05:00
Cristian
bd3c824d45 fix: Escape JSON output on command failure so the user can run the command manually 2020-09-04 10:23:41 -05:00
Nick Sweeting
a645f36b87
add comment about fake cmd 2020-09-01 19:42:22 -04:00
Cristian
66037535fd feat: Add curl command on readability as default command to debug 2020-09-01 10:16:24 -05:00
Cristian
bf3ea42141 fix: Add a default cmd value to handle case where the html cannot be retrieved 2020-08-27 09:51:33 -05:00
Nick Sweeting
d179264cb7 dont warn about chrome twice 2020-08-18 20:08:04 -04:00
Nick Sweeting
a2c158e43e catch OSErrors due to missing path 2020-08-18 19:09:45 -04:00
Nick Sweeting
4428134073 fix version parsing bug 2020-08-18 19:09:45 -04:00