Remove duplicate rows from a big table

Question

I've got data from third party and imported to SQL server. The table has 255,072,636 records and 61,714,772 are unique among these records. The table has neither specific order nor any index. The table has 4 columns: Field1(float), Field2(varchar(255), Field3(varchar(255), Field4(varchar(255). I want to delete the duplicate record based upon Field1 for which I've run the following query: but it

Accepted Answer

Thanks to &#8220;Kazi Mohammad Ali Nur&#8221; and &#8220;eshirvana&#8221;. I&#8217;ve combine there solutions. At first I created index on Field1.CREATE CLUSTERED INDEX Index_Name       ON MyTable(Field1);and then I executed following query to insert unique records into a new table and deleted original table.WITH CTE(Field1, Field2, Field3, Field4, DuplicateCount)AS (SELECT *,            ROW_NUMBER() OVER(PARTITION BY Field1 ORDER BY Field1) AS DuplicateCount    FROM MyTable)select * into TempTable FROM CTEWHERE DuplicateCount = 1;and it worked.Thanks to all.

Advertisement

Answer