Using REGEXP_SUBSTR to extract website domain

Question

I have a field called Website with examples that look like: I am trying to use REGEXP_SUBSTR to isolate the domain: REGEXP_SUBSTR(&#8220;Website&#8221;, &#8216;[^https://]+&#8217;) Some of the results are working but others are not, for instance I am expecting cornstalk.com and penny.co but I am not receiving…

Accepted Answer

You can useSELECT REGEXP_SUBSTR("Website", '^(https?://)?(.*)', 1, 1, 'e', 2)Details:^ &#8211; start of string(https?://)? &#8211; an optional Group 1: http:// or https://(.*) &#8211; Group 2: the rest of the string.The last argument, together with e last but one argument, returns the Group 2 value.However, REGEXP_REPLACE might be better here:SELECT REGEXP_REPLACE("Website", '^https?://', '')That is, just remove the http:// or https:// from the start of a string.

Advertisement

Answer