Commit graph

32 commits

Author SHA1 Message Date
Nick Sweeting
14e66a6909 use derived_link_info more consistently in index generation 2019-03-03 14:10:18 -05:00
Nick Sweeting
b4dc80b0a7 when checking link invariants, check for regex match as well 2019-02-27 04:50:29 -05:00
Nick Sweeting
fa6f53f2af used derived info for all derivable info 2019-02-27 04:50:29 -05:00
Nick Sweeting
ef4c446c8b new compiled URL regex with better markdown support 2019-02-27 04:50:29 -05:00
Nick Sweeting
09d79e55a0 remove derivable link info from links 2019-02-27 04:50:29 -05:00
Nick Sweeting
eb003f6a26 better function naming 2019-02-27 04:50:29 -05:00
Nick Sweeting
328a59749b better remote file downloading cli output messages 2019-02-27 04:50:29 -05:00
Nick Sweeting
b03e9fade8 better link corruption guards, remove title prefetching, save index after run 2019-02-21 17:45:28 -05:00
Nick Sweeting
d689264365 add new config and dependency options 2019-02-21 15:47:15 -05:00
Nick Sweeting
34cfde0c4e dont fetch titles when FETCH_TITLE=False 2019-02-21 12:58:51 -05:00
Nick Sweeting
935dcac0c7 fix title showing up as None in some UI spots 2019-02-19 02:40:02 -05:00
Nick Sweeting
2c9aad559d use urllib for url parsing instead of hand written string commands 2019-02-19 01:45:19 -05:00
Nick Sweeting
5a7d00a639 fetch page title during archiving process 2019-02-19 01:44:54 -05:00
Nick Sweeting
56d382235f better progress output 2019-02-06 22:06:36 -08:00
Nick Sweeting
1ca27a60b2 show dots as progress bar for titles fetching 2019-02-04 22:07:59 -08:00
Nick Sweeting
ca8f57ef5c add dedicated shaarli rss parser 2019-02-04 21:18:42 -08:00
root
fb00f1f336 fix flickering progress bars 2019-02-05 04:45:36 +00:00
Nick Sweeting
bc6de29a5a recreate chunks on each new progress bar 2019-02-04 20:41:35 -08:00
Nick Sweeting
3ac0efb619 dont parse quotes as part of urls 2019-02-04 18:54:02 -08:00
Nick Sweeting
c37941efd1 dont match closing tags in full text 2019-01-31 19:46:34 -08:00
Nick Sweeting
2c30cd9468 better closure 2019-01-23 01:08:23 -05:00
Nick Sweeting
db0461ec4e add timeout to title fetching 2019-01-23 01:08:23 -05:00
Nick Sweeting
ce07a6c738 patch subprocess.run to have better timeout handling 2019-01-20 14:08:33 -05:00
Nick Sweeting
e620722bb9 fix timeouts hanging the progressbar 2019-01-20 14:08:33 -05:00
Nick Sweeting
7ea36c4adb bump timeouts and improve curl archive method 2019-01-20 14:08:33 -05:00
Nick Sweeting
39f5e12364 allow passing import list via stdin 2019-01-14 18:11:48 -05:00
Nick Sweeting
6189b1a8af fix invalid old config vars 2019-01-12 03:13:55 -05:00
Nick Sweeting
d35c6cf8b5 fix fetch page title default 2019-01-11 05:19:22 -05:00
Nick Sweeting
67d103a293 dont float round timestamps 2019-01-11 05:19:07 -05:00
Nick Sweeting
cf9d1875c7 add plain text link parsing 2019-01-11 04:09:39 -05:00
Nick Sweeting
0e72addcda remove requests dependency 2019-01-09 04:05:34 -05:00
Nick Sweeting
57d42339a4 rename pip dir archive to archivebox 2018-12-31 20:53:01 -05:00
Renamed from archiver/util.py (Browse further)