Commit graph

50 commits

Author SHA1 Message Date
Nick Sweeting
bb65b2dbec
move almost all config into new archivebox.CONSTANTS 2024-09-25 05:10:09 -07:00
Nick Sweeting
cbf2a8fdc3
rename datetime fields to _at, massively improve ABID generation safety and determinism 2024-09-04 23:42:36 -07:00
Nick Sweeting
d060eaa499
abid gradual improvements, some regrets 2024-09-04 00:08:14 -07:00
Nick Sweeting
24fe958ff3
massively improve Snapshot admin list view query performance 2024-08-26 20:16:43 -07:00
Nick Sweeting
09553d8340
hardcode EXTRACTOR_CHOICES to prevent nondeterministic migrations 2024-08-22 15:36:02 -07:00
Nick Sweeting
0420662174
switch everywhere to use Snapshot.pk and ArchiveResult.pk instead of id 2024-05-13 05:12:12 -07:00
Nick Sweeting
457c42bf84
load EXTRACTORS dynamically using importlib.import_module 2024-05-11 22:28:59 -07:00
Ross Williams
310b4d1242 Add htmltotext extractor
Saves HTML text nodes and selected element attributes in
`htmltotext.txt` for each Snapshot. Primarily intended to be used
for search indexing.
2023-10-23 21:42:32 -04:00
DanielBatteryStapler
94dacc49c7
Fix archive_org icon "exists" 2023-08-15 23:49:54 -04:00
Nick Sweeting
8ebf3e2f93 add config option PREVIEW_ORIGINALS to hide original iframes in snapshot detail pages 2022-05-09 19:31:41 -07:00
Nick Sweeting
a9986f1f05 add timezone support, tons of CSS and layout improvements, more detailed snapshot admin form info, ability to sort by recently updated, better grid view styling, better table layouts, better dark mode support 2021-04-10 04:21:36 -04:00
Nick Sweeting
59d5423483 fix snapshot icon caching and ordering 2021-04-01 02:22:15 -04:00
Nick Sweeting
8b236b9367 cache dir size, snapshot icons, tags str, and title in django cache 2021-02-16 15:49:29 -05:00
Nick Sweeting
988a10a9f6 fix warc path in snapshot_icons 2021-02-16 06:18:05 -05:00
Nick Sweeting
78463c243a remove unused GIT_SHA config option 2021-02-15 20:42:33 -05:00
Nick Sweeting
534ead2440 use the db exclusively for icons instead of hammering filesystem 2021-02-01 02:18:13 -05:00
Nick Sweeting
923f517a8f minor fixes 2021-02-01 02:17:54 -05:00
Nick Sweeting
54c5331693 check for output existance when rendering files icons 2021-01-30 22:04:14 -05:00
Nick Sweeting
15e87353bd only show archive.org if enabled 2021-01-30 22:03:59 -05:00
Nick Sweeting
cc80ceb0a2 fix icons in public index 2021-01-30 05:47:55 -05:00
Nick Sweeting
a98298103d cleanup templates and views 2021-01-30 05:47:55 -05:00
Nick Sweeting
f6c3683ab8 fix snapshot favicon loading spinner height 2021-01-29 00:15:32 -05:00
Nick Sweeting
5c54bcc1f3 fix files icons greying out on public index 2021-01-28 22:57:12 -05:00
Nick Sweeting
f0040580c8 fix files icons escaping 2021-01-28 22:27:17 -05:00
Preston Maness
1810426774 Remove now-unused mark_safe import 2021-01-25 21:16:06 -06:00
Preston Maness
b647581115
Update archivebox/index/html.py
mark_safe is dangerous, as the URL's filename could have malicious HTML fragments in it.

Co-authored-by: Nick Sweeting <git@sweeting.me>
2021-01-25 20:47:57 -06:00
Preston Maness
1989275944 Fix issue #617 by using mark_safe in combination with format_html
I have no experience with Django, so all I'm really going off of is this
stackoverflow

https://stackoverflow.com/a/64498319

which cited this bit of Django documentation:

https://docs.djangoproject.com/en/3.1/ref/utils/#django.utils.html.format_html

After using this method, I no longer get the 500 error or KeyError
exception, and can browse the local server and interact with the single
entry in it (the problematic URL in ArchiveBox#617 with curly braces).

Whether this is the "right" method or not, I have no idea. But it is at
least a start.
2021-01-23 20:32:56 -06:00
Cristian
57d1a3d4e5 refactor: Remove setup_django from html.py 2020-12-11 17:49:16 -05:00
Nick Sweeting
00dfe2d449
Merge branch 'v0.5.0' into cleanup 2020-12-04 20:40:24 -05:00
jdcaballerov
54b25d9a81 Linting 2020-12-03 15:59:45 -05:00
jdcaballerov
d4bca80b50 Use uppercase for constants 2020-12-03 15:44:59 -05:00
jdcaballerov
a1fba9887d Remove write_html_main_index 2020-12-03 09:25:38 -05:00
jdcaballerov
367b12ba40 Replace legacy templates for django templates 2020-12-03 09:16:18 -05:00
Cristian
7008f9b735 feat: move import 2020-11-28 13:11:15 -05:00
Nick Sweeting
a846916b82
remove unused argument 2020-11-28 12:38:15 -05:00
Cristian
fa5de72f9f refactor: Move indexing logic out of logging module 2020-11-28 12:34:40 -05:00
Nick Sweeting
bee1f3e263 fix lint errors 2020-11-28 04:09:59 -05:00
Nick Sweeting
104553489f remove redundant utils file 2020-11-28 02:12:27 -05:00
Nick Sweeting
c9162a6d09 remove finished/not finished spinners 2020-11-28 01:07:02 -05:00
Nick Sweeting
22fb9c2ad7 tweak icons 2020-10-31 19:32:43 -04:00
Nick Sweeting
cafe35c595 show pending in light font 2020-10-31 16:33:31 -04:00
Angel Rey
78f7062761 Replaced os.path in html.py 2020-10-02 15:46:39 -05:00
apkallum
594d9e49ce first attempt to migrate to Pathlib 2020-09-17 09:09:52 -05:00
Cristian
885ff50449 feat: Add html export to list command 2020-09-15 08:05:46 -05:00
Nick Sweeting
430d51756b show archive dir size in link details page 2020-08-18 09:17:37 -04:00
Nick Sweeting
3aeca0e450 fix pending titles and favicons, improve add page, custom admin 2020-07-27 23:26:45 -04:00
Cristian
f845224d6f fix: htmlencode titles before rendering the static html index and detail 2020-07-16 09:20:33 -05:00
Nick Sweeting
cb67b09f9d Merge branch 'master' into django 2020-06-25 21:30:29 -04:00
Nick Sweeting
95007d9137 split up utils into separate files 2019-04-30 23:13:04 -04:00
Nick Sweeting
1b8abc0961 move everything out of legacy folder 2019-04-27 17:26:24 -04:00
Renamed from archivebox/legacy/storage/html.py (Browse further)