Select rows from a table which contains at-least one alphabet in the column

Question

I have column called name under a table in Databricks. I want to find a way to select only those rows from a table, which contains at-least one alphabet character in the name column. Example values in the column: Expected: I need to pick only those values which contains at least one alphabet in it. Or in other words, I

Accepted Answer

You can use rlike with regex:import pyspark.sql.functions as Fdf.filter(F.col("name").rlike(".*[a-zA-Z]+.*")).show()#+--------+#|    name|#+--------+#|    $ank|#|ada124$%|#|    !asd|#| 122acs#|#|  gmgd32|#+--------+Spark SQL equivalent query:SELECT * FROM   dfWHERE  name RLIKE '.*[a-zA-Z]+.*'

Advertisement

Answer