Question 1

Will hyperlinks be preserved?

Accepted Answer

Link text is preserved as-is. The URLs are dropped. For plain text, link destinations aren't representable inline.

Question 2

What about images and media?

Accepted Answer

Images are removed. Their `alt` text isn't currently substituted into the output, since alt text is often missing or low-quality.

Question 3

Does the order match how the page reads visually?

Accepted Answer

It matches the DOM (source) order, which usually matches reading order. Pages relying on CSS for layout (e.g., reordered grids) may produce surprising sequences.

Convert HTML to TXT

About this conversion

When this conversion is useful

Quality and tradeoffs

Frequently asked questions