Use go-readability to remove noise from webpage --> Download all the images and change their refs to point to local images --> Add the content to epub structure created using go-epub
Hi, earlier I created it as a wrapper using calibre and percollate. It was not portable at all.
This weekend, re-wrote it in golang to use go-readability for converting HTML to readable content and then build epub from there.
Now, binary can be used directly without worrying about extra dependencies.
Name is kindle-send but it is just pushing epubs via email. So shouldn't be a problem if you're using a different reader that supports receiving ebooks via email.
I read lot of blogs, made something similar [0] a while ago.
Write now installing it is painful, if you don't have calibre or node js installed.
Re-writing it all in nodejs, so that installation is easier. Calibre is not needed now, as amazon started converting epubs to azw3.
I use substack for sharing links in form of newsletter. I like the platform, it is simple, does one thing well, has a full content RSS feed, editor could use markdown support. Buttondown is better, but I'm currently not planning on paying anything for a while so not switching.
As a reader, I like how I can find relevant blogs to read without subscribing and no doubt it is better than medium in every aspect.
Now I use it to send blogs, books and sometimes send whole archives of a website (you can use it in scripts).
You can export Kindle highlights to Obsidian, so one benefit of making these epubs is how you accumulate the highlights at one place.
Although, name is kindle-send but it can send to any ereader that uses email as a mechanism to send books.
[0] https://github.com/nikhil1raghav/kindle-send