thumbnail of n.png
thumbnail of n.png
n png
(202.9 KB, 500x500)
thumbnail of 456517.png
thumbnail of 456517.png
456517 png
(26.65 KB, 752x592)
Topics * further usage of Interplanetary Wayback Machine (ipwb) * canterlot.com/forum * ipwb should only be ran in a private testing environment?

If you have web raws with a correct timestamp of when it was downloaded then you can put it into Interplanetary Wayback Machine: no WARC needed! This is borderline faking WARCs, so proceed with caution. This worked (zero-byte header, non-empty payload):
> org,ourboard,tvshowtranscripts)/viewtopic.php?f=303&t=19507 20241104011910 {"locator": "urn:ipfs/bafkreihdwdcefgh4dqkjv67uzcmw7ojee6xedzdetojuzjevtenxquvyku/bafkreiguqle7qr3p7qpud3li6nefwnhpsbicigrmdeyxgfdwu4l6v4d7wi", "status_code": "200", "mime_type": "text/html;charset=UTF-8", "original_uri": "https://tvshowtranscripts.ourboard.org/viewtopic.php?f=303&t=19507", "title": "s01e04 - eps1.3_da3m0ns.mp4 - Mr. Robot Transcripts - TvT"}
Added to CDXJ (.txt):
> $ ipwb -d /dns/10.0.0.222/tcp/5001/http replay bafkreicddpbk7q3kgranv3kc-edit.txt

I learned that the CDXJ must be sorted to work, so if you have captures from .net, .biz, .com, and .horse URLs (for example) then you can't just stick that line of text anywhere and have it all work. In vim, run ":sort". Proof that it works (with URLs containing "?"!):
> https://archive.is/2024.12.07-135938/http://ponypalsh4y6olziyjlswfv674utokqhz3y6beym2erqtstcgadmacid.onion:2016/memento/20241104011910/tvshowtranscripts.ourboard.org/viewtopic.php?f=303&t=19507

A capture of an MLP-related forums webpage:
> https://archive.is/2024.12.07-140221/http://ponypalsh4y6olziyjlswfv674utokqhz3y6beym2erqtstcgadmacid.onion:2016/memento/20240311040324/canterlot.com/forum/215-last-post-wins-formerly-spam-stables/

If you run Interplanetary Wayback Machine (ipwb) publicly for long enough, you might get raped/hacked or have something else bad or unwanted or unexpected happen. I think it's also based on Python. Not great to run Python-based web servers, so I've heard.

 >>/11392/
v2:  >>/11394/ (cross-thread)

Image: I hope that TinEye and tvshowtranscr1pts.0urboard.0rg both get breached and have all of their data leaked for free. Second image seems poorly drawn.