Thank you for writing this package #29

mpereira · 2020-01-02T18:42:10Z

Thanks a lot for putting your time into this package. It's crazy useful. org-web-tools-insert-web-page-as-entry is a staple in my workflow.

Feel free to close this. 🙂

The text was updated successfully, but these errors were encountered:

alphapapa · 2020-01-03T04:51:55Z

Hi Murilo,

Thanks for the kind words. It's always encouraging to hear that it's useful to someone. Would you mind sharing a bit about your workflow?

I used to use that function in a capture template to capture Web pages to read later, but after my articles.org file grew to several megabytes in size, I decided to start saving pages' original HTML as attachments using the org-web-tools-archive commands instead. It is less searchable, of course, although I wrote some simple code in a branch to search the archives, and I've also found that Recoll can work to index and search them quickly.

mpereira · 2020-01-06T14:40:11Z

I used to use that function in a capture template to capture Web pages to read later

My workflow is exactly that. Maybe I don't capture as many pages as you have and my articles.org isn't in the megabytes yet!

I decided to start saving pages' original HTML as attachments using the org-web-tools-archive commands instead

"Archival" is a use case I have in mind as well. For example, I'm currently apartment hunting. The webpages for the apartments might be disabled by the owners, making the webpages unavailable, while I'd still like to go over pictures, details and etc.

For that use case I've been using monolith from the shell. If I get some free time soon I'll work on an emacs lisp function to archive webpages as attachments to org headings using monolith. It's also handy for caching webpages locally for reading without internet, on flights for example.

alphapapa · 2020-01-07T13:16:52Z

Cool. Tools like Monolith are interesting, but storing all assets base64-encoded into the HTML has some serious drawbacks to me, so I'm sticking with zip/tar archives for now. Thanks for the feedback!

xvrdm · 2020-04-12T14:05:14Z

Thanks for the great library!

Did you consider using something like SingleFile ?

matiya · 2020-04-12T16:27:13Z

I also want to say thank you for this! It's has been a dramatic change to the way I consume websites.

Whenever I see a longish article on the web I read it as an org file to free myself from the distractions of a full browser. I never save the articles as I know I won't read them later, instead I start condensing information by deleting the parts that don't interest me. Finally I review what was left and possibly extract anki cards (via the amazing anki-editor with the content if I think it's something that might be useful in the future.

xvrdm · 2020-04-13T13:48:55Z

That’s a very interesting alternative to copy pasting parts of interest! Start from the whole of thing and keep only what matters : thanks for sharing!

alphapapa · 2020-04-13T20:12:09Z

Thanks for the great library!

Thanks.

Did you consider using something like SingleFile ?

I think I've seen it before. It appears to be only a browser extension, not something I could use from Emacs. If there were a way to run a shell command that caused SingleFileZ to be used to save a page to an archive, that might be useful. Of course, doing so within a browser (rather than using Wget or archive.today) raises issues of unwanted page content, ads, scripts, etc.

xuchunyang · 2020-04-14T00:09:34Z

a shell command that caused SingleFileZ to be used to save a page to an archive

The README mentions it has a command line tool as well
https://github.com/gildas-lormeau/SingleFile#command-line-interface

alphapapa · 2020-04-14T02:37:24Z

a shell command that caused SingleFileZ to be used to save a page to an archive

The README mentions it has a command line tool as well
https://github.com/gildas-lormeau/SingleFile#command-line-interface

Thanks, that's interesting. The setup is a bit much, and it uses Node.js, so I think I'll pass. But it may be useful to others.

xvrdm · 2020-04-14T18:17:43Z

Yes I only suggested it because it does have a CLI which I used with good results. But I understand if you aren’t too keen on the required stack.

I also pointed it to the author of org-board who might look into it.

elsatch · 2020-05-31T06:31:03Z

Thank you for all the effort into this package!

I've been following your steps, trying to use org-web-tools from org-capture. So far, I have not been successful. Every time I call any of the functions from org-capture I get a (wrong-number-of-arguments).

I've tried to use the %c as a parameter, called using interactive, but so far I haven't managed to get it working.

Do you have any org-capture template example available somewhere?

Thanks in advance!

alphapapa · 2020-05-31T15:13:24Z

@elsatch Here's one I use:

("cl" "Link to web page" entry
      (file+datetree "~/org/cpb.org")
      "* %(org-web-tools--org-link-for-url) :website:

%U %?" :clock-in t :clock-resume t :empty-lines 1)

elsatch · 2020-06-02T04:24:17Z

Thank you so much @alphapapa !

Eason0210 · 2020-09-30T16:00:17Z

Thanks you very much for this package, it very useful for me to convert webset to org.

alphapapa added this to the N/A milestone Dec 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thank you for writing this package #29

Thank you for writing this package #29

mpereira commented Jan 2, 2020

alphapapa commented Jan 3, 2020

mpereira commented Jan 6, 2020 •

edited

Loading

alphapapa commented Jan 7, 2020

xvrdm commented Apr 12, 2020

matiya commented Apr 12, 2020

xvrdm commented Apr 13, 2020

alphapapa commented Apr 13, 2020

xuchunyang commented Apr 14, 2020

alphapapa commented Apr 14, 2020

xvrdm commented Apr 14, 2020

elsatch commented May 31, 2020

alphapapa commented May 31, 2020

elsatch commented Jun 2, 2020

Eason0210 commented Sep 30, 2020

Thank you for writing this package #29

Thank you for writing this package #29

Comments

mpereira commented Jan 2, 2020

alphapapa commented Jan 3, 2020

mpereira commented Jan 6, 2020 • edited Loading

alphapapa commented Jan 7, 2020

xvrdm commented Apr 12, 2020

matiya commented Apr 12, 2020

xvrdm commented Apr 13, 2020

alphapapa commented Apr 13, 2020

xuchunyang commented Apr 14, 2020

alphapapa commented Apr 14, 2020

xvrdm commented Apr 14, 2020

elsatch commented May 31, 2020

alphapapa commented May 31, 2020

elsatch commented Jun 2, 2020

Eason0210 commented Sep 30, 2020

mpereira commented Jan 6, 2020 •

edited

Loading