Commit graph

3085 commits

Author SHA1 Message Date
Igor Rzegocki
05de1c9fe6
healthcheck endpoint 2021-10-03 19:12:03 +02:00
Igor Rzegocki
95cf85f8cf
Support for Reverse Proxy authentication backends (like authelia) 2021-09-30 17:40:13 +02:00
Bruno Tavares
bb2a2e758a
Avoid KeyError on Pocket API parser
When trying to import my pocket library I got a lot of ` KeyError`  on Python. Pocket API has a few idiosyncrasies, such as sometimes returning the keys on json, sometimes not.

` ` ` sh
archivebox add --parser pocket_api pocket://my_username
` ` ` 

Gave me this errors
` ` ` 
  File "/app/archivebox/parsers/pocket_api.py", line 54, in link_from_article
    title = article['resolved_title'] or article['given_title'] or url
KeyError: 'resolved_title'
` ` ` 

This commit are the patches I've changed to successfully import my library
2021-09-07 21:53:36 -03:00
Nick Sweeting
63693bdc77
add nix install instructions, thanks @siraben! 2021-08-11 11:12:58 -04:00
Nick Sweeting
2e5937d9e1
Merge pull request #822 from overhacked/bug_pinboard_rss
Fix Pinboard RSS parsing valid links as `None`
2021-08-04 12:23:40 -04:00
Ross Williams
f6cf35a45d Fix Pinboard RSS parsing valid links as None
`item.find(p)` returns either an `ElementTree.Element` or `None`.  The
[lambda on line 24][lambda] coerces the return value to a bool, which is
`False` if the `<link>` element has no children (see
[`ElementTree.py` line 207][etbooldef]), so the lambda returns `None`.

Further, returning a `Link` with `url=None` violates
[an assertion in `index/schema.py`][assertion], which crashes
the `archivebox add` command.

[lambda]: 3d54b1321b/archivebox/parsers/pinboard_rss.py (L24)
[etbooldef]: 3d8993a744/Lib/xml/etree/ElementTree.py (L207)
[assertion]: 3d54b1321b/archivebox/index/schema.py (L165)
2021-08-04 10:13:37 -04:00
Nick Sweeting
3d54b1321b
Merge pull request #804 from munzevibirinsan/patch-1 2021-07-21 08:28:08 -04:00
munzevibirinsan
ee86eb534d
Update setup.sh 2021-07-21 13:24:31 +03:00
Nick Sweeting
2427e6d3dc
Merge pull request #799 from Inndy/dev
Discard Referer header from iframe and link to original URL
2021-07-19 10:16:52 -04:00
Inndy
0e81a0722e Discard Referer header from iframe and link to original URL 2021-07-19 21:48:01 +08:00
Nick Sweeting
ca819b8015
Update README.md 2021-07-07 06:55:07 -04:00
Nick Sweeting
9100ad63a7
Merge pull request #784 from ArchiveBox/dependabot/npm_and_yarn/ws-5.2.3
Bump ws from 5.2.2 to 5.2.3
2021-07-07 00:44:33 -04:00
dependabot[bot]
1084f6aacc
Bump ws from 5.2.2 to 5.2.3
Bumps [ws](https://github.com/websockets/ws) from 5.2.2 to 5.2.3.
- [Release notes](https://github.com/websockets/ws/releases)
- [Commits](https://github.com/websockets/ws/compare/5.2.2...5.2.3)

---
updated-dependencies:
- dependency-name: ws
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
2021-07-07 03:42:46 +00:00
Nick Sweeting
767e32c08c bump version number 2021-07-06 23:42:05 -04:00
Nick Sweeting
5a2c78e14b add proper support for URL_WHITELIST instead of using negation regexes 2021-07-06 23:42:00 -04:00
Nick Sweeting
e4974d3536 support negation patterns by checking both re.search and re.match 2021-07-06 23:17:05 -04:00
Nick Sweeting
65d452fe7b
add documentation for contributing a new extractor 2021-07-01 22:48:39 -04:00
Nick Sweeting
f6aad7f9a3
Merge pull request #777 from tjhorner/exempt-add-from-csrf
Exempt /add route from CSRF
2021-07-01 21:40:10 -04:00
TJ Horner
cdcfb7fa44 Exempt /add route from CSRF 2021-07-01 20:55:51 -04:00
Nick Sweeting
66f17cb46b
Merge pull request #766 from Namdrib/dev
Docs: Improve spelling on readme
2021-06-10 21:01:44 -04:00
Namdrib
6cffe7fa0f Docs: Improve spelling on readme 2021-06-07 21:36:28 +09:30
Nick Sweeting
e0a2b2e252 ominous warnings 2021-06-01 03:03:42 -04:00
Nick Sweeting
aa53fe653c fix use of uneeded perms arg 2021-06-01 02:58:36 -04:00
Nick Sweeting
c2d1a57581 fix umask dir permissions 2021-06-01 00:50:18 -04:00
Nick Sweeting
4715ace7dd ignore BaseException lgtm errors 2021-05-31 20:59:05 -04:00
Nick Sweeting
9f1470cf03 fix output permissions tests 2021-05-31 20:57:46 -04:00
Nick Sweeting
8230f88d80 change default OUTPUT_PERMISSIONS to disallow execution except on dirs 2021-05-31 19:31:51 -04:00
Nick Sweeting
1112526543 add option ENFORCE_ATOMIC_WRITES to allow disabling forced FSYNC writes on network drives 2021-05-31 19:31:51 -04:00
Nick Sweeting
e529f550d8
Update README.md 2021-05-13 05:34:03 -04:00
Nick Sweeting
ecc178b87f
Update README.md 2021-05-13 05:31:40 -04:00
Nick Sweeting
9dbc9d64fb
Update README.md 2021-05-13 05:27:48 -04:00
Nick Sweeting
11a7c4ef11
Update README.md 2021-05-13 05:24:36 -04:00
Nick Sweeting
a7fb8556ff
Update README.md 2021-05-13 05:21:49 -04:00
Nick Sweeting
decd18eea0
Update README.md 2021-05-13 05:14:39 -04:00
Nick Sweeting
254ff41b1a
Update README.md 2021-05-13 05:11:54 -04:00
Nick Sweeting
49faec8f6d
add no-zygote and single-process args to try and prevent orphan chrome processes after exit 2021-05-13 05:04:23 -04:00
Nick Sweeting
930fb394c3
Update README.md 2021-05-13 04:19:22 -04:00
Nick Sweeting
64a9200ac9
Update README.md 2021-05-13 00:55:35 -04:00
Nick Sweeting
d8c1be3f06
Update README.md 2021-05-13 00:43:29 -04:00
Nick Sweeting
b512b372ba
Update README.md 2021-05-13 00:39:27 -04:00
Nick Sweeting
4187afdebc
Update README.md 2021-05-13 00:37:18 -04:00
Nick Sweeting
c63fe0250e
Update README.md 2021-05-13 00:30:12 -04:00
Nick Sweeting
eb4d3bca9d
Update readability.py 2021-05-13 00:13:32 -04:00
Nick Sweeting
8504d47984
Update README.md 2021-05-12 23:37:19 -04:00
Nick Sweeting
76f29d0325 fix merge 2021-05-12 23:35:57 -04:00
Nick Sweeting
54a11d7927 add python dependencies 2021-05-12 23:35:25 -04:00
Nick Sweeting
a8c1ffc5ba
Update README.md 2021-05-07 09:43:50 -04:00
Nick Sweeting
d43ce963de
Update README.md 2021-05-07 09:43:30 -04:00
Nick Sweeting
cac13964cb
Update README.md 2021-05-07 09:30:44 -04:00
Nick Sweeting
987915e5ea
Update README.md 2021-05-07 09:27:42 -04:00