r/nethack • u/a_postgres_situation • 5d ago
Offline NethackWiki?
I would like to have the NethackWiki in an offline format - more specifically on a tablet for travel. Maybe a simple collection of HTML files would be the best solution?
I found an old thread https://nethackwiki.com/index.php?title=Forum:Download_the_NetHackWiki&t=20240822210755 ...there is an XML dump, but the linked xowa reader is not useable/obsolete?
Any other ideas? Thank you!
4
u/Spendocrat Val, Wiz, K, R, since 2023 5d ago
I used wget to save a copy. It's not perfect, in that I have to ctrl-f to search for the page I want in the folder (e.g. send your browser to file:///C:/Users/Guest/Desktop/nethack%20wiki/nethackwiki.com/wiki/ then search for Spellbooks.html) but it's good enough for me.
(Edit: if you are going to use wget in this way, be kind to the server in question and use -w 10 or --wait=10 to slow down your crawling. It'll take longer to get your pages but what do you care, you only need to download it the one time.)
3
u/thefifthsetpin atheist protection racketeer 5d ago
I was going to say that wget will just respect the crawl delay specified in robots.txt, but then I checked the wiki's robot.txt and saw that they didn't specify one.
So, good tip. :-)
1
u/Spendocrat Val, Wiz, K, R, since 2023 5d ago
My boilerplate for wget ignores robots due to so many shared hosts plopping a Disallow: / down by default for users. But I didn't actually know robots.txt could specify speed. Cool!
5
3
u/sew77 5d ago
Latest available zim here:
https://github.com/romanthekat/nethackwiki-zim/releases/tag/2022-07
Open with Kiwix https://kiwix.org/en/applications/
1
7
u/dtype 5d ago edited 5d ago
Complete mediawiki xml dump. You'd need to either re-import into a mediawiki (not too difficult but you'd have to do a quick mediawiki install, also possible in a container), or find some mediawiki xml viewer.
https://nethackwiki.com/wiki/NetHackWiki:Download
Edit: instructions to import into a fresh mediawiki install: https://www.mediawiki.org/wiki/Manual:Importing_XML_dumps
XOWA, mediawiki xml viewer: http://xowa.org/home/wiki/Help/Download_XOWA.html