Nick Sweeting
|
c9b3bab84d
|
fix pull title not working
|
2020-08-18 08:49:26 -04:00 |
|
Nick Sweeting
|
f18d92570e
|
wip attempt to fix timestamp unique constraint errors
|
2020-08-18 08:30:09 -04:00 |
|
Nick Sweeting
|
b0c0a676f8
|
re-enable readability and singlefile by default now that its less noisy
|
2020-08-18 08:29:46 -04:00 |
|
Nick Sweeting
|
15efb2d5ed
|
new generic_html parser for extracting hrefs
|
2020-08-18 08:29:05 -04:00 |
|
Nick Sweeting
|
a682a9c478
|
make all parsers accept arbitrary meta kwargs
|
2020-08-18 08:27:47 -04:00 |
|
Nick Sweeting
|
2e2b4f8150
|
fix url is too long to be a path error
|
2020-08-18 08:23:57 -04:00 |
|
Nick Sweeting
|
e3ac4c2405
|
htmldecode downloaded sources before parsing for links
|
2020-08-18 08:23:20 -04:00 |
|
Nick Sweeting
|
fc0d064ddf
|
add hints
|
2020-08-18 08:22:34 -04:00 |
|
Nick Sweeting
|
9d093a34f8
|
better dependency version check
|
2020-08-18 08:21:55 -04:00 |
|
Nick Sweeting
|
2f282402c7
|
add hint and prefix stdout utils
|
2020-08-18 08:21:36 -04:00 |
|
Nick Sweeting
|
2557a0da04
|
tweak cli output coloring
|
2020-08-18 08:16:18 -04:00 |
|
Nick Sweeting
|
d7d53cfb12
|
dont show skipped extractors to reduce visual noise
|
2020-08-18 08:13:35 -04:00 |
|
Nick Sweeting
|
5f84a7bc6e
|
better handle the case where json index lags behind sql index
|
2020-08-18 08:13:13 -04:00 |
|
Nick Sweeting
|
77d2f08a5c
|
show more info in merge conflict error message
|
2020-08-18 08:12:35 -04:00 |
|
Nick Sweeting
|
718d39e242
|
add common code extensions to default blacklist
|
2020-08-18 08:12:10 -04:00 |
|
Nick Sweeting
|
3658153cf8
|
fix url parsing through quotes
|
2020-08-18 08:04:57 -04:00 |
|
Nick Sweeting
|
a3155b88a1
|
better error message when nothing left to resume
|
2020-08-18 04:42:39 -04:00 |
|
Nick Sweeting
|
f8eb245519
|
better archivebox schedule CLI UX
|
2020-08-18 04:39:58 -04:00 |
|
Nick Sweeting
|
ec5609a09a
|
fix resume flag on archivebox update
|
2020-08-18 04:39:39 -04:00 |
|
Nick Sweeting
|
12f3977da5
|
fix source filenames missing domain
|
2020-08-18 04:39:29 -04:00 |
|
Nick Sweeting
|
ec1cce283e
|
include failed link updates in total count stats
|
2020-08-18 04:39:19 -04:00 |
|
Nick Sweeting
|
6e142e3531
|
match existing snapshots on url only, ignoring timestamp
|
2020-08-18 04:39:04 -04:00 |
|
Nick Sweeting
|
f371032b71
|
show warning when killing archivebox during index writing
|
2020-08-18 04:38:29 -04:00 |
|
Nick Sweeting
|
92de20af15
|
better detect missing dependencies on startup
|
2020-08-18 04:38:13 -04:00 |
|
Nick Sweeting
|
b681a477ae
|
add overwrite flag to add command to force re-archiving
|
2020-08-18 04:37:54 -04:00 |
|
Nick Sweeting
|
da671532a4
|
fix schedule cmd defaulting to day and remove week
|
2020-08-18 04:35:59 -04:00 |
|
Nick Sweeting
|
84a8b0d713
|
hide progress bars on mac
|
2020-08-18 03:27:52 -04:00 |
|
Nick Sweeting
|
29c8da83d6
|
0.4.16 release
|
2020-08-18 02:08:52 -04:00 |
|
Nick Sweeting
|
7c16944a44
|
Merge pull request #446 from cdvv7788/hotfix/#445
|
2020-08-18 02:06:32 -04:00 |
|
Nick Sweeting
|
7638dc45ea
|
0.4.15 release
|
2020-08-18 01:59:50 -04:00 |
|
Nick Sweeting
|
235eb20dbd
|
support cron in docker
|
2020-08-18 01:59:04 -04:00 |
|
Nick Sweeting
|
494be09bc2
|
add depth flag to schedule cmd
|
2020-08-18 01:58:54 -04:00 |
|
Cristian
|
05c71fc302
|
fix: Organize readability extractor so a timeout does not break the whole process
|
2020-08-17 08:34:40 -05:00 |
|
Nick Sweeting
|
225b63b732
|
skip invalid urls at all stages
|
2020-08-17 03:12:17 -04:00 |
|
Nick Sweeting
|
429f39dec1
|
0.4.14 release
|
2020-08-14 13:13:50 -04:00 |
|
Nick Sweeting
|
58e928520a
|
tweak log output for skipped methods
|
2020-08-14 13:12:50 -04:00 |
|
Nick Sweeting
|
03b73bfe77
|
Update archivebox/extractors/readability.py
|
2020-08-14 12:55:22 -04:00 |
|
Nick Sweeting
|
050b717bb9
|
Merge branch 'master' into readability-extractor
|
2020-08-14 12:35:35 -04:00 |
|
Nick Sweeting
|
0ef2b17678
|
only show data locations in version output when in a data dir
|
2020-08-13 23:21:57 -04:00 |
|
Nick Sweeting
|
a0901ba474
|
use BIND_ADDR config default for runserver
|
2020-08-13 23:21:37 -04:00 |
|
Cristian
|
b7aa3df8d2
|
feat: Disable singlefile and readability by default
|
2020-08-12 14:42:21 -05:00 |
|
Cristian
|
eb3528fa9f
|
feat: Add readability output to legacy index.html
|
2020-08-11 12:14:13 -05:00 |
|
Cristian
|
5dc7e63792
|
feat: Update dockerfile to support readability
|
2020-08-11 11:52:43 -05:00 |
|
Cristian
|
2a68af1b94
|
tests: Add readability tests
|
2020-08-11 11:15:15 -05:00 |
|
Cristian
|
8aa7b34de7
|
tests: Add readability to ignored methods in tests
|
2020-08-11 08:58:49 -05:00 |
|
Cristian
|
dc87d8b68c
|
tests: Update failing tests
|
2020-08-11 08:48:13 -05:00 |
|
Cristian
|
0ec747f64e
|
feat: Look in wget, singlefile or dom outputs before attempting to download the information again
|
2020-08-11 08:37:12 -05:00 |
|
Cristian
|
a14762640e
|
feat: Avoid running readability when the target is a file
|
2020-08-11 08:37:12 -05:00 |
|
Cristian
|
61e08a7c43
|
docs: Update docs link
|
2020-08-11 08:37:12 -05:00 |
|
Cristian
|
b33c66a9f7
|
feat: Split output of readability into multiple files
|
2020-08-11 08:37:12 -05:00 |
|
Cristian
|
7e2b249388
|
feat: Initial version of readability extractor
|
2020-08-11 08:37:12 -05:00 |
|
apkallum
|
50069d1eb3
|
set tz variable globally as UTC
|
2020-08-10 23:21:02 -04:00 |
|
apkallum
|
e9bd0b122e
|
fix: utc timing for initial command log as well
|
2020-08-10 19:17:17 -04:00 |
|
Nick Sweeting
|
cd09d1b077
|
0.4.13 release
|
2020-08-10 14:39:06 -04:00 |
|
Nick Sweeting
|
fcbc61917e
|
0.4.12 release
|
2020-08-10 14:26:32 -04:00 |
|
Nick Sweeting
|
33ab7fd4ec
|
autodetect when running inside docker and provide hints
|
2020-08-10 14:18:04 -04:00 |
|
Nick Sweeting
|
f24cb3dcbe
|
add docker help text
|
2020-08-10 13:42:31 -04:00 |
|
Nick Sweeting
|
430be7bc68
|
add missing staticfile check to singlefile
|
2020-08-10 13:42:20 -04:00 |
|
Cristian
|
76846d18a0
|
docs: Improve message for missing singlefile binary
|
2020-08-10 09:00:10 -05:00 |
|
Cristian
|
e358634f81
|
fix: Add missing configuration that breaks on edge case where only single file is being used
|
2020-08-08 09:12:14 -05:00 |
|
Nick Sweeting
|
87ba82ad39
|
0.4.11 release
|
2020-08-06 23:10:59 -04:00 |
|
Nick Sweeting
|
5b8abb2dce
|
bump version
|
2020-08-06 23:10:37 -04:00 |
|
Nick Sweeting
|
19aa5c3e94
|
fix SAVE_SINGLEFILE setting to depend on chrome
|
2020-08-06 23:07:25 -04:00 |
|
Cristian
|
3c5c6a689e
|
fix: Add missing configuration variable to be able to disable singlefile
|
2020-08-04 07:35:58 -05:00 |
|
Cristian
|
06d0e9de6c
|
feat: Add support for singlefile in docker
|
2020-08-03 13:23:05 -05:00 |
|
Nick Sweeting
|
5b6eb5e4ad
|
make filenames consistent with program name
|
2020-08-03 13:23:05 -05:00 |
|
Cristian
|
91f63635e8
|
feat: Add singlefile in a couple more places
|
2020-08-03 13:22:06 -05:00 |
|
Cristian
|
b325c0dd9f
|
feat: Add singlefile to latest outputs
|
2020-08-03 13:22:06 -05:00 |
|
Cristian
|
a40e337280
|
feat: Add link to admin list of files
|
2020-08-03 13:22:06 -05:00 |
|
Cristian
|
42b0c80465
|
feat: Add singlefile to link_details
|
2020-08-03 13:22:06 -05:00 |
|
Nick Sweeting
|
3d22da39fe
|
Update archivebox/config/__init__.py
|
2020-08-03 13:22:06 -05:00 |
|
Cristian
|
787a5ad43e
|
fix: Commit code review suggestions
|
2020-08-03 13:22:06 -05:00 |
|
Cristian
|
853685668c
|
feat: Add initial support for singlefile extractor
|
2020-08-03 13:22:06 -05:00 |
|
Nick Sweeting
|
dd916e91d0
|
Merge pull request #396 from cdvv7788/oneshot-command
|
2020-08-01 13:44:51 -04:00 |
|
Cristian
|
d0d2991c69
|
fix: Change import that was not working
|
2020-07-31 12:15:00 -05:00 |
|
Cristian Vargas
|
b2a318c5eb
|
fix: Update error message for oneshot command
Co-authored-by: Nick Sweeting <git@sweeting.me>
|
2020-07-31 10:51:54 -05:00 |
|
Cristian
|
a8c74730f8
|
docs: Add docstring to oneshot method
|
2020-07-31 10:28:30 -05:00 |
|
Cristian
|
e6c571beb2
|
fix: Remove title from extractors for oneshot
|
2020-07-31 10:24:58 -05:00 |
|
Cristian
|
8bcb171e74
|
fix: Remove support for multiple urls in oneshot command
|
2020-07-31 09:05:40 -05:00 |
|
Nick Sweeting
|
5707ffe657
|
fix old config name FETCH_TITLE
|
2020-07-30 16:55:24 -04:00 |
|
Nick Sweeting
|
a160e6bf20
|
fix None canon output to be emptystring
|
2020-07-29 23:54:50 -04:00 |
|
Nick Sweeting
|
9dedcdd577
|
remove inaccurate updated ts from main index UI
|
2020-07-29 23:54:50 -04:00 |
|
Cristian
|
3afb2401bc
|
fix: Add condition to avoid breaking the add command
|
2020-07-29 11:53:49 -05:00 |
|
Cristian
|
c073ea141d
|
feat: Initial oneshot command proposal
|
2020-07-29 11:19:06 -05:00 |
|
Nick Sweeting
|
c1f21880f3
|
0.4.9 release
|
2020-07-28 08:25:01 -04:00 |
|
Nick Sweeting
|
3c7966c13a
|
dont get bin path when bin is missing
|
2020-07-28 07:20:57 -04:00 |
|
Nick Sweeting
|
1b96c582a7
|
fix lint and improve docker-compose instructions
|
2020-07-28 07:18:10 -04:00 |
|
Nick Sweeting
|
9248ff5890
|
0.4.8 release
|
2020-07-28 06:52:44 -04:00 |
|
Nick Sweeting
|
acc697e73c
|
0.4.7 release
|
2020-07-28 06:51:18 -04:00 |
|
Nick Sweeting
|
9806ed8d8c
|
fix circular import
|
2020-07-28 06:50:03 -04:00 |
|
Nick Sweeting
|
301e220c53
|
v0.4.6
|
2020-07-28 06:22:24 -04:00 |
|
Nick Sweeting
|
b8c93889c1
|
hide prints and tweak url text in titlebar
|
2020-07-28 06:03:52 -04:00 |
|
Nick Sweeting
|
b1082cfbaa
|
ui and css improvements
|
2020-07-28 06:00:09 -04:00 |
|
Nick Sweeting
|
5a30e03778
|
rearrange tags column and improve files icons
|
2020-07-28 05:59:54 -04:00 |
|
Nick Sweeting
|
2e0b751376
|
accept methods argument to filder archive_link
|
2020-07-28 05:58:38 -04:00 |
|
Nick Sweeting
|
032c2458de
|
add missing setup_django import
|
2020-07-28 05:58:13 -04:00 |
|
Nick Sweeting
|
9e7330cc14
|
add init flag to server and fix SHOW_PROGRESS config being ignored
|
2020-07-28 05:57:34 -04:00 |
|
Nick Sweeting
|
55a237a435
|
also set snapshot title inside of fetch_title directly
|
2020-07-28 05:56:34 -04:00 |
|
Nick Sweeting
|
273059f054
|
accept gzipped responses when using curl
|
2020-07-28 05:55:54 -04:00 |
|
Nick Sweeting
|
af9084ee95
|
update Snapshot.title to latest_title after fetching
|
2020-07-28 05:55:09 -04:00 |
|