Saving, converting, and archiving files/websites
How to easily save a page as PDF, converting a file, or archiving a website (either online, locally, or saving each page as a single HTML file).
Save a page as a PDF
You can save a page as a PDF with
- Save as PDF
- you may add/remove the headers in the options
- if you want to remove something else, before CTRL-P
- open the console
- go to Elements
- press CTRL+SHIFT+C (or use the selection icon)
- select the element you want to remove
- remove the HTML code that got focused in Elements
- then go back to the printing as PDF window
Simply use your favorite search engine (DuckDuckGo, Google, Qwant, Ecosia, ...)
- search "xxx to yyy" to convert xxx to yyy
- like "PNG to PDF"
- You got these too
- "merge PDF"
- "compress PDF"
Some links (not clickable)
- https://www.ilovepdf.com/ (a lot of PDF's tools, great)
- https://png2pdf.com/ (not many tools but the best)
- https://www.img2go.com/ (not tested, a batch of tools)
The most well-known archiving tool is https://archive.org/web/, allowing you to archive one page (per one page) of a website or a lot of other things.
You may also install an archive manager locally, if you feel like it, like with ArchiveBox
- on Windows (auto-script)
- you can use inside a git bash (installed by git-scm)
curl -sSL 'https://get.archivebox.io' | sh
- but you must have docker and python+pip installed
- (it didn't work for me)
- on Windows (WSL - Ubuntu)
- I followed the official tutorial
- but before starting
sudo apt-get install python3
sudo apt-get install python3-pip
- and if you got a problem (infinite fetching)
- CTRL-C (exit)
sudo npm install -g yarn
yarn install(in the archivebox directory)
You can run the web server with
archivebox server 0.0.0.0:8000 and open
http://127.0.0.1:8000/ in your browser.
After trying to use archive.org (but it's slow, and I'm looking for a file rather than this). I tried to use "ArchiveBox", but this was still not what I wanted (taking too long to archive, too tiring to use).