Possible memory leak in simple batch file processing function in c#

Question

I'm running a very simple function that reads lines from a text file in batches. Each line contains an sql query so the function grabs a specified number of queries, executes them against the SQL database, then grabs the next batch of queries until the entire file is read. The problem is that over time with very large files the

Accepted Answer

There are many things wrong with your code:You never dispose your command. That&#8217;s a native handle to an ODBC driver, waiting for the garbage collector to dispose it is very bad practice. You shouldn&#8217;t be sending those commands individually anyway. Either send them all at once in one command, or use transactions to group them together.This one is the reason why it&#8217;s getting slower over time: File.ReadLines(file).Skip(skipCount).Take(batchSize) will read the same file over and over and ignore a growing amount of lines every attempt, and so growing slower and slower as the number of lines ignored (but processed) gets larger and larger.To fix #3, simply create the enumerator once and iterate it in batches. In pure C#, you can do something like:using var enumerator = File.ReadLines(file).GetEnumerator();for (int x = 0; x<= totalBatchesInt; x++){    var lines = new List<string>();    while(enumerator.MoveNext() && lines.Count < batchSize)        list.Add(enumerator.Current);    string test = string.Join("n", lines);    // your code...}Or if you&#8217;re using Morelinq (which I recommend), something like this:foreach(var lines in File.ReadLines(file).Batch(batchSize)){    // your code...}

Advertisement

Answer