Nick Sweeting
|
61ec4971e9
|
fix nested archive index page and improve wget output finding
|
2019-03-19 18:10:11 -04:00 |
|
Nick Sweeting
|
eb5cc8078a
|
better UX before titles have been fetched during archiving progress
|
2019-03-19 18:10:11 -04:00 |
|
Nick Sweeting
|
914750c453
|
better title regex to match titles surrounded by newlines
|
2019-03-19 18:10:11 -04:00 |
|
Nick Sweeting
|
1b5201fd58
|
re-save index on Ctrl+c to hide in progress message on html output
|
2019-03-19 18:10:11 -04:00 |
|
noncetonic
|
28758cf16c
|
Adds CHROME_USER_AGENT
|
2019-03-19 10:15:52 -07:00 |
|
noncetonic
|
e230e27929
|
Changes HEADLESS_USER_AGENT to CHROME_USER_AGENT
|
2019-03-19 08:13:27 -07:00 |
|
noncetonic
|
a13f22d15a
|
Adds support for HEADLESS_USER_AGENT for Chrome
|
2019-03-19 05:32:48 -07:00 |
|
Nick Sweeting
|
1c1bc76ac1
|
add chrome headless option and improve default data dir finding
|
2019-03-12 17:50:10 -04:00 |
|
Nick Sweeting
|
5e583573d5
|
pretty warning when missing distutils
|
2019-03-12 15:50:41 -04:00 |
|
Nick Sweeting
|
8319ccf064
|
add docs link to config.py
|
2019-03-12 12:48:46 -04:00 |
|
Nick Sweeting
|
10bb970d66
|
Update archive_methods.py
|
2019-03-12 12:45:33 -04:00 |
|
Nick Sweeting
|
c474bb7992
|
fix settings not being applied
|
2019-03-11 03:13:59 -04:00 |
|
Nick Sweeting
|
32c39d0fd0
|
cleaner output dir spec in config
|
2019-03-08 17:51:49 -05:00 |
|
Nick Sweeting
|
2e10f57f6e
|
fix relative links from index files
|
2019-03-08 17:46:14 -05:00 |
|
Nick Sweeting
|
ce13a57a2c
|
fix favicon not existing
|
2019-03-08 17:30:36 -05:00 |
|
Nick Sweeting
|
0f84c40f69
|
dont use latest to override derived info
|
2019-03-08 17:29:32 -05:00 |
|
Nick Sweeting
|
450b4534ad
|
actually fix path
|
2019-03-08 17:10:18 -05:00 |
|
Nick Sweeting
|
5c401007d3
|
fix output path
|
2019-03-08 17:05:53 -05:00 |
|
Nick Sweeting
|
83a96bb823
|
fix missing vars
|
2019-03-08 17:03:48 -05:00 |
|
Nick Sweeting
|
c7fc9e1878
|
remove dead code and cleanup utils file
|
2019-03-08 17:01:15 -05:00 |
|
Nick Sweeting
|
354ea142e7
|
fix double path in archive_url
|
2019-03-08 16:31:25 -05:00 |
|
Nick Sweeting
|
a74d8410f4
|
also check for macOS binary defaults
|
2019-03-08 16:25:42 -05:00 |
|
Nick Sweeting
|
b2ccb7dbcb
|
refactor error hint printing to be DRYer
|
2019-03-08 16:25:42 -05:00 |
|
Nick Sweeting
|
14e66a6909
|
use derived_link_info more consistently in index generation
|
2019-03-03 14:10:18 -05:00 |
|
Nick Sweeting
|
552734241b
|
put git clones in a git folder to avoid retries
|
2019-02-27 15:55:39 -05:00 |
|
Nick Sweeting
|
3eaa76267e
|
fix keyerror domain bug
|
2019-02-27 15:42:53 -05:00 |
|
Nick Sweeting
|
b4dc80b0a7
|
when checking link invariants, check for regex match as well
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
fa6f53f2af
|
used derived info for all derivable info
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
ef4c446c8b
|
new compiled URL regex with better markdown support
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
b2c22a73e6
|
move parsers to global instead of func
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
af7e8df0eb
|
rename download url func
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
09d79e55a0
|
remove derivable link info from links
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
eb003f6a26
|
better function naming
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
328a59749b
|
better remote file downloading cli output messages
|
2019-02-27 04:50:29 -05:00 |
|
Nick Sweeting
|
b03e9fade8
|
better link corruption guards, remove title prefetching, save index after run
|
2019-02-21 17:45:28 -05:00 |
|
Nick Sweeting
|
c95632883e
|
fix without_hash to without_fragment
|
2019-02-21 16:03:19 -05:00 |
|
Nick Sweeting
|
d689264365
|
add new config and dependency options
|
2019-02-21 15:47:15 -05:00 |
|
Nick Sweeting
|
34cfde0c4e
|
dont fetch titles when FETCH_TITLE=False
|
2019-02-21 12:58:51 -05:00 |
|
Nick Sweeting
|
d52c9c5304
|
allow passing COOKIES_FILE to wget
|
2019-02-21 12:58:51 -05:00 |
|
Nick Sweeting
|
935dcac0c7
|
fix title showing up as None in some UI spots
|
2019-02-19 02:40:02 -05:00 |
|
Nick Sweeting
|
eff0100971
|
fix RSS parser bailing out when lines have whitespace before tags
|
2019-02-19 02:31:53 -05:00 |
|
Nick Sweeting
|
3571ef24e4
|
fix logic for ONLY_NEW accidentally replacing all links
|
2019-02-19 02:21:28 -05:00 |
|
Nick Sweeting
|
1b36d5b29c
|
fix pocket timestamps defaulting to now
|
2019-02-19 01:54:09 -05:00 |
|
Nick Sweeting
|
2c9aad559d
|
use urllib for url parsing instead of hand written string commands
|
2019-02-19 01:45:19 -05:00 |
|
Nick Sweeting
|
8576a2f061
|
better parser explanation comment
|
2019-02-19 01:45:03 -05:00 |
|
Nick Sweeting
|
5a7d00a639
|
fetch page title during archiving process
|
2019-02-19 01:44:54 -05:00 |
|
Nick Sweeting
|
bb5879a4f7
|
fix some parser errors not being caught by bail out process
|
2019-02-18 23:45:49 -05:00 |
|
Nick Sweeting
|
74b99fe9eb
|
fix import_path None error
|
2019-02-12 20:04:03 -05:00 |
|
Nick Sweeting
|
e6d5cd4432
|
ignore robots.txt when using wget
|
2019-02-06 22:06:36 -08:00 |
|
Nick Sweeting
|
56d382235f
|
better progress output
|
2019-02-06 22:06:36 -08:00 |
|