REGEXP_REPLACE Strings Starting and Ending with Specific Substrings in Snowflake

Question

I am trying to create a column in a view in Snowflake that replaces any string between strings that I care about with nothing. This is essentially for the purpose of stripping html formatting out of text. As an example: Would should end up like this: Based on the patterns I am seeing, I think that if I can el…

Accepted Answer

Your regular expression works, but it requires lookarounds.set sample1 = '&lt;ul&gt;';set sample2 = '&lt;li&gt;Text I care about 1';set sample3 = '&lt;li&gt;Text I care about 2&lt;/li&gt;';set sample4 = '&lt;li&gt;Text I care about 3&lt;/li&gt;';set sample5 = '&lt;/ul&gt;';select regexp_replace2($SAMPLE1,'&lt.+?&gt;','');  select regexp_replace2($SAMPLE2,'&lt.+?&gt;','');select regexp_replace2($SAMPLE3,'&lt.+?&gt;','');select regexp_replace2($SAMPLE4,'&lt.+?&gt;','');select regexp_replace2($SAMPLE5,'&lt.+?&gt;','');I wrote a UDF library that supports regular expression lookarounds. It attempts to approximate the built-in Snowflake regular expression functions while supporting lookarounds. The names of the UDFs are the same as the built-in regular expression functions with the suffix &#8220;2&#8221; as shown in the SQL sample.https://github.com/GregPavlik/SnowflakeUDFs/tree/main/RegularExpressions

Advertisement

Answer