mirror of
https://github.com/ArchiveBox/ArchiveBox
synced 2024-11-22 12:13:05 +00:00
Update README.md
This commit is contained in:
parent
2c51430a31
commit
dd1216546c
1 changed files with 3 additions and 3 deletions
|
@ -38,13 +38,13 @@ Without active preservation effort, everything on the internet eventually dissap
|
||||||
**It saves snapshots of the URLs you feed it in several redundant formats.**
|
**It saves snapshots of the URLs you feed it in several redundant formats.**
|
||||||
It also detects any content featured *inside* pages & extracts it out into a folder:
|
It also detects any content featured *inside* pages & extracts it out into a folder:
|
||||||
- 🌐 **HTML**/**Any websites** ➡️ `original HTML+CSS+JS`, `singlefile HTML`, `screenshot PNG`, `PDF`, `WARC`, `title`, `article text`, `favicon`, `headers`, ...
|
- 🌐 **HTML**/**Any websites** ➡️ `original HTML+CSS+JS`, `singlefile HTML`, `screenshot PNG`, `PDF`, `WARC`, `title`, `article text`, `favicon`, `headers`, ...
|
||||||
- 🎥 **Social Media**/**News** ➡️ `post content TXT`, `comments`, `title`, `author`, `images`
|
- 🎥 **Social Media**/**News** ➡️ `post content TXT`, `comments`, `title`, `author`, `images`, ...
|
||||||
- 🎬 **YouTube**/**SoundCloud**/etc. ➡️ `MP3/MP4`s, `subtitles`, `metadata`, `thumbnail`, ...
|
- 🎬 **YouTube**/**SoundCloud**/etc. ➡️ `MP3/MP4`s, `subtitles`, `metadata`, `thumbnail`, ...
|
||||||
- 💾 **Github**/**Gitlab**/etc. links ➡️ `clone of GIT source code`, `README`, `images`, ...
|
- 💾 **Github**/**Gitlab**/etc. links ➡️ `clone of GIT source code`, `README`, `images`, ...
|
||||||
- ✨ *and more, see [Output Formats](#output-formats) below...*
|
- ✨ *and more, see [Output Formats](#output-formats) below...*
|
||||||
|
|
||||||
It uses [standard tools](#dependencies) like Chrome, `wget`, & `yt-dlp`, and stores data in ordinary [files & folders](#archive-layout).
|
It uses [standard tools](#dependencies) like Chrome, `wget`, & `yt-dlp`, and stores data in [ordinary files & folders](#archive-layout) and SQLite3.
|
||||||
*(no complex proprietary formats)*
|
*(no complex proprietary formats, accessible decades in the future without needing to run ArchiveBox)*
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
|
|
Loading…
Reference in a new issue