URL Extractor

Extract all HTTP and HTTPS URLs from text or HTML with deduplication and separator options.

Input Text

Remove duplicates

Separator:

About This Tool

The URL Extractor scans any block of text and extracts all URLs starting withhttp:// or https://. It handles full URLs including paths, query parameters, and fragments, and strips trailing punctuation that is not part of the URL (such as trailing periods or parentheses at the end of a sentence).

Options include removing duplicate URLs and choosing the output separator. Paste raw HTML, Markdown, logs, or any text and the tool will extract the links. All processing runs locally in your browser.

How to Use

Paste text containing URLs (HTML, Markdown, logs, email body) into the input or click Sample.
Toggle Remove duplicates to deduplicate identical URLs.
Choose an output separator (one per line is most useful for further processing).
Copy the extracted URL list.

Use Cases

SEO analysts extract all URLs from a page's HTML to audit links. Developers pull webhook URLs from configuration exports or log files. Researchers extract references from academic papers pasted as text. QA engineers extract API endpoints from server logs for testing. Content managers audit outbound links in an article before publishing.

FAQ

Does it extract non-HTTP URLs (ftp://, mailto:)? — Currently only http:// and https:// URLs are extracted, as these cover the vast majority of use cases.
Does it work with HTML href attributes? — Yes. Paste raw HTML and the extractor will find URLs in href="...", src="...", and inline text.
What if a URL contains parentheses? — Parentheses at the end of a URL are stripped as trailing punctuation. URLs with parentheses in the middle (like Wikipedia links) are extracted correctly.
Is there a size limit? — No. All processing is browser-side regex matching and handles large inputs.